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Recommended Books and Resources 

• M. Peskin and D. Schroeder, An Introduction to Quantum Field Theory 

This is a very clear and comprehensive book, covering everything in this course at the 
right level. It will also cover everything in the "Advanced Quantum Field Theory" 
course, much of the "Standard Model" course, and will serve you well if you go on to 
do research. To a large extent, our course will follow the first section of this book. 

There is a vast array of further Quantum Field Theory texts, many of them with 
redeeming features. Here I mention a few very different ones. 

• S. Weinberg, The Quantum Theory of Fields, Vol 1 

This is the first in a three volume series by one of the masters of quantum field theory. 
It takes a unique route to through the subject, focussing initially on particles rather 
than fields. The second volume covers material lectured in "AQFT" . 

• L. Ryder, Quantum Field Theory 

This elementary text has a nice discussion of much of the material in this course. 

• A. Zee, Quantum Field Theory in a Nutshell 

This is charming book, where emphasis is placed on physical understanding and the 
author isn't afraid to hide the ugly truth when necessary. It contains many gems. 

• M Srednicki, Quantum Field Theory 

A very clear and well written introduction to the subject. Both this book and Zee's 
focus on the path integral approach, rather than canonical quantization that we develop 
in this course. 



There are also resources available on the web. Some particularly good ones are listed 
on the course webpage: http://www.damtp.cam.ac.uk/user/tong/qft.html 
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0. Introduction 



"There are no real one-particle systems in nature, not even few-particle 
systems. The existence of virtual pairs and of pair fluctuations shows that 
the days of fixed particle numbers are over." 

Viki Weisskopf 

The concept of wave-particle duality tells us that the properties of electrons and 
photons are fundamentally very similar. Despite obvious differences in their mass and 
charge, under the right circumstances both suffer wave-like diffraction and both can 
pack a particle-like punch. 

Yet the appearance of these objects in classical physics is very different. Electrons 
and other matter particles are postulated to be elementary constituents of Nature. In 
contrast, light is a derived concept: it arises as a ripple of the electromagnetic field. If 
photons and particles are truely to be placed on equal footing, how should we reconcile 
this difference in the quantum world? Should we view the particle as fundamental, 
with the electromagnetic field arising only in some classical limit from a collection of 
quantum photons? Or should we instead view the field as fundamental, with the photon 
appearing only when we correctly treat the field in a manner consistent with quantum 
theory? And, if this latter view is correct, should we also introduce an "electron field", 
whose ripples give rise to particles with mass and charge? But why then didn't Faraday, 
Maxwell and other classical physicists find it useful to introduce the concept of matter 
fields, analogous to the electromagnetic field? 

The purpose of this course is to answer these questions. We shall see that the second 
viewpoint above is the most useful: the field is primary and particles are derived 
concepts, appearing only after quantization. We will show how photons arise from the 
quantization of the electromagnetic field and how massive, charged particles such as 
electrons arise from the quantization of matter fields. We will learn that in order to 
describe the fundamental laws of Nature, we must not only introduce electron fields, 
but also quark fields, neutrino fields, gluon fields, W and Z-boson fields, Higgs fields 
and a whole slew of others. There is a field associated to each type of fundamental 
particle that appears in Nature. 

Why Quantum Field Theory? 

In classical physics, the primary reason for introducing the concept of the field is to 
construct laws of Nature that are local. The old laws of Coulomb and Newton involve 
"action at a distance" . This means that the force felt by an electron (or planet) changes 
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immediately if a distant proton (or star) moves. This situation is philosophically un- 
satisfactory. More importantly, it is also experimentally wrong. The field theories of 
Maxwell and Einstein remedy the situation, with all interactions mediated in a local 
fashion by the field. 

The requirement of locality remains a strong motivation for studying field theories 
in the quantum world. However, there are further reasons for treating the quantum 
field as fundamental 1 . Here I'll give two answers to the question: Why quantum field 
theory? 

Answer 1: Because the combination of quantum mechanics and special relativity 
implies that particle number is not conserved. 

Particles are not indestructible objects, made at the 
beginning of the universe and here for good. They can be 
created and destroyed. They are, in fact, mostly ephemeral 
and fleeting. This experimentally verified fact was first 
predicted by Dirac who understood how relativity implies 
the necessity of anti-particles. An extreme demonstra- 
tion of particle creation is shown in the picture, which 
comes from the Relativistic Heavy Ion Collider (RHIC) at 
Brookhaven, Long Island. This machine crashes gold nu- 
clei together, each containing 197 nucleons. The resulting 
explosion contains up to 10,000 particles, captured here in 
all their beauty by the STAR detector. 

We will review Dirac's argument for anti-particles later in this course, together with 
the better understanding that we get from viewing particles in the framework of quan- 
tum field theory. For now, we'll quickly sketch the circumstances in which we expect 
the number of particles to change. Consider a particle of mass m trapped in a box 
of size L. Heisenberg tells us that the uncertainty in the momentum is Ap > h/L. 
In a relativistic setting, momentum and energy are on an equivalent footing, so we 
should also have an uncertainty in the energy of order AE > hc/L. However, when 
the uncertainty in the energy exceeds AE = 2mc 2 , then we cross the barrier to pop 
particle anti-particle pairs out of the vacuum. We learn that particle-anti-particle pairs 
are expected to be important when a particle of mass m is localized within a distance 
of order 

A = — 

mc 

1 A concise review of the underlying principles and major successes of quantum field theory can be 
found in the article by Frank Wilczek, http://arxiv.org/abs/hep-th/9803075 
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At distances shorter than this, there is a high probability that we will detect particle- 
anti-particle pairs swarming around the original particle that we put in. The distance A 
is called the Compton wavelength. It is always smaller than the de Broglie wavelength 
-W = h/\p\. If you like, the de Broglie wavelength is the distance at which the wavelike 
nature of particles becomes apparent; the Compton wavelength is the distance at which 
the concept of a single pointlike particle breaks down completely 

The presence of a multitude of particles and antiparticles at short distances tells us 
that any attempt to write down a relativistic version of the one-particle Schrodinger 
equation (or, indeed, an equation for any fixed number of particles) is doomed to failure. 
There is no mechanism in standard non-relativistic quantum mechanics to deal with 
changes in the particle number. Indeed, any attempt to naively construct a relativistic 
version of the one-particle Schrodinger equation meets with serious problems. (Negative 
probabilities, infinite towers of negative energy states, or a breakdown in causality are 
the common issues that arise). In each case, this failure is telling us that once we 
enter the relativistic regime we need a new formalism in order to treat states with an 
unspecified number of particles. This formalism is quantum field theory (QFT). 

Answer 2: Because all particles of the same type are the same 

This sound rather dumb. But it's not! What I mean by this is that two electrons 
are identical in every way, regardless of where they came from and what they've been 
through. The same is true of every other fundamental particle. Let me illustrate this 
through a rather prosaic story. Suppose we capture a proton from a cosmic ray which 
we identify as coming from a supernova lying 8 billion lightyears away. We compare 
this proton with one freshly minted in a particle accelerator here on Earth. And the 
two are exactly the same! How is this possible? Why aren't there errors in proton 
production? How can two objects, manufactured so far apart in space and time, be 
identical in all respects? One explanation that might be offered is that there's a sea 
of proton "stuff" filling the universe and when we make a proton we somehow dip our 
hand into this stuff and from it mould a proton. Then it's not surprising that protons 
produced in different parts of the universe are identical: they're made of the same stuff. 
It turns out that this is roughly what happens. The "stuff" is the proton field or, if 
you look closely enough, the quark field. 

In fact, there's more to this tale. Being the "same" in the quantum world is not 
like being the "same" in the classical world: quantum particles that are the same are 
truely indistinguishable. Swapping two particles around leaves the state completely 
unchanged — apart from a possible minus sign. This minus sign determines the statis- 
tics of the particle. In quantum mechanics you have to put these statistics in by hand 
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and, to agree with experiment, should choose Bose statistics (no minus sign) for integer 
spin particles, and Fermi statistics (yes minus sign) for half-integer spin particles. In 
quantum field theory, this relationship between spin and statistics is not something 
that you have to put in by hand. Rather, it is a consequence of the framework. 

What is Quantum Field Theory? 

Having told you why QFT is necessary, I should really tell you what it is. The clue is in 
the name: it is the quantization of a classical field, the most familiar example of which 
is the electromagnetic field. In standard quantum mechanics, we're taught to take the 
classical degrees of freedom and promote them to operators acting on a Hilbert space. 
The rules for quantizing a field are no different. Thus the basic degrees of freedom in 
quantum field theory are operator valued functions of space and time. This means that 
we are dealing with an infinite number of degrees of freedom — at least one for every 
point in space. This infinity will come back to bite on several occasions. 

It will turn out that the possible interactions in quantum field theory are governed 
by a few basic principles: locality, symmetry and renormalization group flow (the 
decoupling of short distance phenomena from physics at larger scales). These ideas 
make QFT a very robust framework: given a set of fields there is very often an almost 
unique way to couple them together. 

What is Quantum Field Theory Good For? 

The answer is: almost everything. As I have stressed above, for any relativistic system 
it is a necessity. But it is also a very useful tool in non-relativistic systems with many 
particles. Quantum field theory has had a major impact in condensed matter, high- 
energy physics, cosmology, quantum gravity and pure mathematics. It is literally the 
language in which the laws of Nature are written. 

0.1 Units and Scales 

Nature presents us with three fundamental dimensionful constants; the speed of light c, 
Planck's constant (divided by 2n) K and Newton's constant G. They have dimensions 



LT 



1-1 



[h] 
[G] 



L 2 MT- X 
L 3 M -i T -2 



Throughout this course we will work with "natural" units, defined by 



c = h=l 



(0.1) 
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which allows us to express all dimensionful quantities in terms of a single scale which 
we choose to be mass or, equivalently, energy (since E = mc 2 has become E = m). 
The usual choice of energy unit is eV, the electron volt or, more often GeV = 10 9 eV" or 
TeV = 10 12 eV. To convert the unit of energy back to a unit of length or time, we need 
to insert the relevant powers of c and h. For example, the length scale A associated to 
a mass m is the Compton wavelength 



A 



h 

mc 



With this conversion factor, the electron mass m £ 
K = 2 x 10- 12 m. 



10 6 eV translates to a length scale 



Throughout this course we will refer to the dimension of a quantity, meaning the 
mass dimension. If X has dimensions of (mass) d we will write [X] = d. In particular, 
the surviving natural quantity G has dimensions [G] = —2 and defines a mass scale, 



G 



he 

p 



M 2 
p 



(0.2) 



where M p w 10 GeV is the Planck scale. It corresponds to a length l p « 10~ 6i cm. The 
Planck scale is thought to be the smallest length scale that makes sense: beyond this 
quantum gravity effects become important and it's no longer clear that the concept 
of spacetime makes sense. The largest length scale we can talk of is the size of the 
cosmo logical horizon, roughly 10 60 / p . 
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Universe 
• 20 billion light years 
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Earth 10 cm 



Cosmological 
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Atoms Nuclei 
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10- J eV 
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Figure 2: Energy and Distance Scales in the Universe 



Some useful scales in the universe are shown in the figure. This is a logarithmic plot, 
with energy increasing to the right and, correspondingly, length increasing to the left. 
The smallest and largest scales known are shown on the figure, together with other 
relevant energy scales. The standard model of particle physics is expected to hold up 



- 5 - 



to about the TeV. This is precisely the regime that is currently being probed by the 
Large Hadron Collider (LHC) at CERN. There is a general belief that the framework 
of quantum field theory will continue to hold to energy scales only slightly below the 
Planck scale — for example, there are experimental hints that the coupling constants 
of electromagnetism, and the weak and strong forces unify at around 10 18 GeV. 

For comparison, the rough masses of some elementary (and not so elementary) par- 
ticles are shown in the table, 



Particle 


Mass 


neutrinos 


~ 1(T 2 eV 


electron 


0.5 MeV 


Muon 


100 MeV 


Pions 


140 MeV 


Proton, Neutron 


1 GeV 


Tau 


2 GeV 


W,Z Bosons 


80-90 GeV 


Higgs Boson 


125 GeV 
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1. Classical Field Theory 



In this first section we will discuss various aspects of classical fields. We will cover only 
the bare minimum ground necessary before turning to the quantum theory, and will 
return to classical field theory at several later stages in the course when we need to 
introduce new ideas. 

1.1 The Dynamics of Fields 

A field is a quantity defined at every point of space and time (x,t). While classical 
particle mechanics deals with a finite number of generalized coordinates q a (t), indexed 
by a label a, in field theory we are interested in the dynamics of fields 

where both a and x are considered as labels. Thus we are dealing with a system with an 
infinite number of degrees of freedom — at least one for each point x in space. Notice 
that the concept of position has been relegated from a dynamical variable in particle 
mechanics to a mere label in field theory. 

An Example: The Electromagnetic Field 

The most familiar examples of fields from classical physics are the electric and magnetic 

— * — * 

fields, E(x,t) and B(x,t). Both of these fields are spatial 3-vectors. In a more sophis- 
ticated treatement of electromagnetism, we derive these two 3-vectors from a single 
4-component field A^(x, t) = (0, A) where /i — 0, 1, 2, 3 shows that this field is a vector 
in spacetime. The electric and magnetic fields are given by 

OA 

E = -V0- — and B = V x A (1.2) 

which ensure that two of Maxwell's equations, V • B — and dB/dt = — V x E, hold 
immediately as identities. 

The Lagrangian 

The dynamics of the field is governed by a Lagrangian which is a function of (f>(x,t), 
<f>(x,t) and V(f)(x,t). In all the systems we study in this course, the Lagrangian is of 
the form, 

L(t) = J d 3 x£(<j) a ,dM (1.3) 



-7- 



where the official name for C is the Lagrangian density, although everyone simply calls 
it the Lagrangian. The action is, 

S = J* dt J d 3 x C = J d A x C (1.4) 

Recall that in particle mechanics L depends on q and q, but not q. In field theory 
we similarly restrict to Lagrangians £ depending on and 0, and not 0. In principle, 
there's nothing to stop C depending on V0, V 2 0, V 3 0, etc. However, with an eye to 
later Lorentz invariance, we will only consider Lagrangians depending on V0 and not 
higher derivatives. Also we will not consider Lagrangians with explicit dependence on 
x M ; all such dependence only comes through and its derivatives. 

We can determine the equations of motion by the principle of least action. We vary 
the path, keeping the end points fixed and require SS = 0, 

dC _ dC 



5S 



d A x 
d 4 x 



+ 



dC 



d„ 



dC 



5(<9 At a ) 



dC 



1.5) 



The last term is a total derivative and vanishes for any 8<p a (x,t) that decays at spatial 
infinity and obeys 6<f) a (x,ti) = 6<fi a (x, t 2 ) = 0. Requiring 6S = for all such paths 
yields the Euler-Lagrange equations of motion for the fields a , 

dC \ dC 







M \d(d^(j) a ) J d(f) a 
1.1.1 An Example: The Klein-Gordon Equation 

Consider the Lagrangian for a real scalar field (f)(x,t), 



;i.6) 



2 '/ u iiY yJ v 



- Wcj) 2 



;i.7) 



¥ 2 



-(V0) 2 - W 



where we are using the Minkowski space metric 



Comparing (1.7) to the usual expression for the Lagrangian L = T — V, we identify the 
kinetic energy of the field as 



T 



d 3 x |0 2 



;i.9) 
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and the potential energy of the field as 

V = J d 3 x \{V(b) 2 + \m 2 (b 2 (1.10) 

The first term in this expression is called the gradient energy, while the phrase "poten- 
tial energy", or just "potential", is usually reserved for the last term. 

To determine the equations of motion arising from (1.7), we compute 

w=- m * and wr sw -- v ^ (L11) 

The Euler-Lagrange equation is then 

4>-V 2 (p + m 2 (f) = (1.12) 

which we can write in relativistic form as 

<9^0 + m 2 = (1.13) 

This is the Klein- Gordon Equation. The Laplacian in Minkowski space is sometimes 
denoted by □. In this notation, the Klein-Gordon equation reads □</> + m 2 <j) = 0. 

An obvious generalization of the Klein-Gordon equation comes from considering the 
Lagrangian with arbitrary potential V((b), 

C=\d^d^-V{ ( p) => 5^+^ = (1.14) 

1.1.2 Another Example: First Order Lagrangians 

We could also consider a Lagrangian that is linear in time derivatives, rather than 
quadratic. Take a complex scalar field ib whose dynamics is defined by the real La- 
grangian 

C= ^(^ip - ip*i/;) - VV>*- Vip-mip*ip (1.15) 

We can determine the equations of motion by treating ib and ip* as independent objects, 
so that 

OH i • OH i OjO 

—— = -ib — mil) and — — = — ib and — - , = — (1-16) 
dib* V y dip* ^ dVib* * v ; 

This gives us the equation of motion 

i^ = _V 2 ^ + m^ (1.17) 

This looks very much like the Schrodinger equation. Except it isn't! Or, at least, the 
interpretation of this equation is very different: the field ib is a classical field with none 
of the probability interpretation of the wavefunction. We'll come back to this point in 
Section 2.8. 
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The initial data required on a Cauchy surface differs for the two examples above. 
When C ~ (f> 2 , both <f> and must be specified to determine the future evolution; 
however when C ~ ?/>*■?/>, only ip and ^* are needed. 

1.1.3 A Final Example: Maxwell's Equations 

We may derive Maxwell's equations in the vacuum from the Lagrangian, 

C = -\ (d,A u ) (d»A v ) + \{d^f (1.18) 

Notice the funny minus signs! This is to ensure that the kinetic terms for A4 are positive 
using the Minkowski space metric (1.8), so C ~ \Aj. The La grangian (1.18) has no 
kinetic term for A . We will see the consequences of this in Section 6. To see that 
Maxwell's equations indeed follow from (1.18), we compute 

^- )= - d ^ + (W (1.19) 

from which we may derive the equations of motion, 

d " { d(d A ) ) = ~^ A " + d ^ d " AP) = - d ^ A " - d " Afl ) = ~ d ^ u (i- 20 ) 

where the field strength is defined by = d^A v — d u A^. You can check using (1.2) 
that this reproduces the remaining two Maxwell's equations in a vacuum: V • E — 
and dE/dt = V x B. Using the notation of the field strength, we may rewrite the 
Maxwell Lagrangian (up to an integration by parts) in the compact form 

C = -\ F^ (1.21) 

1.1.4 Locality, Locality, Locality 

In each of the examples above, the Lagrangian is local. This means that there are no 
terms in the Lagrangian coupling 4>(x, t) directly to <p(y, t) with x 7^ y . For example, 
there are no terms that look like 

d 3 xd 3 y <f>{x)<t){y) (1.22) 

A priori, there's no reason for this. After all, x is merely a label, and we're quite 
happy to couple other labels together (for example, the term d^A Q 8qA 3 in the Maxwell 
Lagrangian couples the \i = field to the \i = 3 field). But the closest we get for the 
x label is a coupling between <p(x) and (f)(x + Sx) through the gradient term (V</>) 2 . 
This property of locality is, as far as we know, a key feature of all theories of Nature. 
Indeed, one of the main reasons for introducing field theories in classical physics is to 
implement locality. In this course, we will only consider local Lagrangians. 
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1.2 Lorentz Invariance 



The laws of Nature are relativistic, and one of the main motivations to develop quantum 
field theory is to reconcile quantum mechanics with special relativity. To this end, we 
want to construct field theories in which space and time are placed on an equal footing 
and the theory is invariant under Lorentz transformations, 



x" — >■ {x'Y = A^x u 



where A M „ satisfies 



;i.23) 



;i.24) 



For example, a rotation by 9 about the x 3 -axis, and a boost by v < 1 along the x 1 -axis 
are respectively described by the Lorentz transformations 



(\ o o\ 

cos 9 — sin 9 
sin 9 cos 9 
\0 1/ 



and A^„ = 





7 


— 71! 





o\ 




— JV 


7 
















1 





V 











1/ 



(1.25) 



with 7 = 1/Vl — v 2 . The Lorentz transformations form a Lie group under matrix 
multiplication. You'll learn more about this in the "Symmetries and Particle Physics" 
course. 

The Lorentz transformations have a representation on the fields. The simplest ex- 
ample is the scalar field which, under the Lorentz transformation x — > Ax, transforms 

as 



<j>{x) ->• <j>'{x) = (PiA^x) 



1.26) 



The inverse A -1 appears in the argument because we are dealing with an active trans- 
formation in which the field is truly shifted. To see why this means that the inverse 
appears, it will suffice to consider a non-relativistic example such as a temperature field. 
Suppose we start with an initial field <j){x) which has a hotspot at, say, x = (1,0,0). 
After a rotation x — > Rx about the z-axis, the new field 4>'(x) will have the hotspot at 
x — (0, 1, 0). If we want to express <p'(x) in terms of the old field 0, we need to place 
ourselves at x — (0, 1,0) and ask what the old field looked like where we've come from 
at -R _1 (0, 1,0) = (1,0,0). This R^ 1 is the origin of the inverse transformation. (If we 
were instead dealing with a passive transformation in which we relabel our choice of 
coordinates, we would have instead (f>(x) — > 4>'{x) = (fi(Ax)). 
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The definition of a Lorentz invariant theory is that if <fi{x) solves the equations of 
motion then cf)(hr 1 x) also solves the equations of motion. We can ensure that this 
property holds by requiring that the action is Lorentz invariant. Let's look at our 
examples: 

Example 1: The Klein-Gordon Equation 

For a real scalar field we have <fi{x) — > <fi'(x) = cf)(hr 1 x). The derivative of the scalar 
field transforms as a vector, meaning 

-> {^ l )\{dA){y) 

where y = A _1 x. This means that the derivative terms in the Lagrangian density 
transform as 

c denv (x) = d^{x)d v 4>{x)^ v — ► (A-'y^dMy) (A-'TAd^iy) if v 

= {d p <j>){y) (S^)(y)iT 

= £deriv(y) (1-27) 

The potential terms transform in the same way, with <j) 2 (x) — > <p 2 (y)- Putting this all 
together, we find that the action is indeed invariant under Lorentz transformations, 

S = J d 4 x C(x) — >• / d 4 x C{y) = J d A y C{y) = S (1.28) 

where, in the last step, we need the fact that we don't pick up a Jacobian factor when 
we change integration variables from J d 4 x to J d A y. This follows because detA = 1. 
(At least for Lorentz transformation connected to the identity which, for now, is all we 
deal with). 

Example 2: First Order Dynamics 

In the first-order Lagrangian (1.15), space and time are not on the same footing. (C 
is linear in time derivatives, but quadratic in spatial derivatives). The theory is not 
Lorentz invariant. 

In practice, it's easy to see if the action is Lorentz invariant: just make sure all 
the Lorentz indices \i = 0,1,2,3 are contracted with Lorentz invariant objects, such 
as the metric rj^. Other Lorentz invariant objects you can use include the totally 
antisymmetric tensor e^ upa and the matrices 7^ that we will introduce when we come 
to discuss spinors in Section 4. 
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Example 3: Maxwell's Equations 

Under a Lorentz transformation A^(x) — > A u (A~ 1 x). You can check that Maxwell's 
Lagrangian (1.21) is indeed invariant. Of course, historically electrodynamics was the 
first Lorentz invariant theory to be discovered: it was found even before the concept of 
Lorentz invariance. 

1.3 Symmetries 

The role of symmetries in field theory is possibly even more important than in particle 
mechanics. There are Lorentz symmetries, internal symmetries, gauge symmetries, 
supersymmetries.... We start here by recasting Noether's theorem in a field theoretic 
framework. 

1.3.1 Noether's Theorem 

Every continuous symmetry of the Lagrangian gives rise to a conserved current j^(x) 
such that the equations of motion imply 

<V M = (1.29) 

or, in other words, dj°/dt + V • j = 0. 

A Comment: A conserved current implies a conserved charge Q, defined as 

Q = f d 3 x j° (1.30) 

which one can immediately see by taking the time derivative, 

d Q f ,3 , « 



d 6 x = - / d x V ■ j = (1.31) 
dt J R 3 at y R 3 

assuming that j — ^ sufficiently quickly as |x| — > oo. However, the existence of a 
current is a much stronger statement than the existence of a conserved charge because 
it implies that charge is conserved locally. To see this, we can define the charge in a 
finite volume V, 

Q v = j d 3 xj° (1.32) 
Jv 

Repeating the analysis above, we find that 

dQ v 



dt 



d 6 xV-j = - / j-dS (1.33) 

J A 
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where A is the area bounding V and we have used Stokes' theorem. This equation 
means that any charge leaving V must be accounted for by a flow of the current 3- 
vector j out of the volume. This kind of local conservation of charge holds in any local 
field theory. 

Proof of Noether's Theorem: We'll prove the theorem by working infinitesimally. 
We may always do this if we have a continuous symmetry. We say that the transfor- 
mation 

5Mx) = X a {4>) (1.34) 

is a symmetry if the Lagrangian changes by a total derivative, 

5C = <9 M i^ (1.35) 

for some set of functions F M (0). To derive Noether's theorem, we first consider making 
an arbitrary transformation of the fields 5<p a . Then 

d(j) a o[d^a) 



dC dC 

^ O n 



-7' 



When the equations of motion are satisfied, the term in square brackets vanishes. So 
we're left with 

i£ = a '(TO^) (L37) 

But for the symmetry transformation 5<p a = X a ((p), we have by definition 5C = d^F^ 1 . 
Equating this expression with (1.37) gives us the result 

dC 

d^ = with j* = X a (<fi) - F»(4>) (1.38) 

1.3.2 An Example: Translations and the Energy-Momentum Tensor 

Recall that in classical particle mechanics, invariance under spatial translations gives 
rise to the conservation of momentum, while invariance under time translations is 
responsible for the conservation of energy. We will now see something similar in field 
theories. Consider the infinitesimal translation 

x v ^x v -e v <j> a {x) -> <f> a {x) + e v d v <j> a (x) (1.39) 
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(where the sign in the field transformation is plus, instead of minus, because we're doing 
an active, as opposed to passive, transformation). Similarly, once we substitute a spe- 
cific field configuration 4>(x) into the Lagrangian, the Lagrangian itself also transforms 

as 

C(x) ->• £{x) + e v d u C(x) (1.40) 

Since the change in the Lagrangian is a total derivative, we may invoke Noether's 
theorem which gives us four conserved currents (j 74 )^, one for each of the translations 
e u with v = 0,1,2,3, 

dC 

T p v is called the energy-momentum tensor. It satisfies 

d^TI = (1.42) 
The four conserved quantities are given by 

E = [ d 3 x T 00 and P i = I d 3 x T 0i (1.43) 



where E is the total energy of the field configuration, while P l is the total momentum 
of the field configuration. 

An Example of the Energy-Momentum Tensor 

Consider the simplest scalar field theory with Lagrangian (1.7). From the above dis- 
cussion, we can compute 

T» u = d^<f)d u <j)-^ u £ (1.44) 

One can verify using the equation of motion for <fi that this expression indeed satisfies 
d^T^ = 0. For this example, the conserved energy and momentum are given by 

E = J d 3 x \ft + \{V4>f + \m 2 <\) 2 (1.45) 
d 3 x0<9> (1.46) 



Notice that for this example, T^ u came out symmetric, so that T^ u = T utl . This 
won't always be the case. Nevertheless, there is typically a way to massage the energy 
momentum tensor of any theory into a symmetric form by adding an extra term 

where T ptiu is some function of the fields that is anti-symmetric in the first two indices so 
pp^f — —Y ppv . This guarantees that d^dfX 9 ^ = so that the new energy- momentum 
tensor is also a conserved current. 
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A Cute Trick 



One reason that you may want a symmetric energy-momentum tensor is to make con- 
tact with general relativity: such an object sits on the right-hand side of Einstein's 
field equations. In fact this observation provides a quick and easy way to determine a 
symmetric energy-momentum tensor. Firstly consider coupling the theory to a curved 
background spacetime, introducing an arbitrary metric g^x) in place of rj^, and re- 
placing the kinetic terms with suitable covariant derivatives using "minimal coupling" . 
Then a symmetric energy momentum tensor in the flat space theory is given by 

e- = - -L d ^ c) (i.48) 

V=9 dg, u giiv=Vfiu 

It should be noted however that this trick requires a little more care when working 
with spinors. 

1.3.3 Another Example: Lorentz Transformations and Angular Momentum 

In classical particle mechanics, rotational invariance gave rise to conservation of angular 
momentum. What is the analogy in field theory? Moreover, we now have further 
Lorentz transformations, namely boosts. What conserved quantity do they correspond 
to? To answer these questions, we first need the infinitesimal form of the Lorentz 
transfer mat ions 

A"„ = P v + < (1.49) 

where is infinitesimal. The condition (1.24) for A to be a Lorentz transformation 
becomes 

=^ + oj v " = (1.50) 

So the infinitesimal form lo^ u of the Lorentz transformation must be an anti-symmetric 
matrix. As a check, the number of different 4x4 anti-symmetric matrices is 4 x 3/2 = 6, 
which agrees with the number of different Lorentz transformations (3 rotations + 3 
boosts). Now the transformation on a scalar field is given by 

4>{x) ->> <p\x) = 4>(AT l x) 

= (j)(x^ - oo^x u ) 

= (f)(x ti )-u^x v d lx (f)(x) (1.51) 
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from which we see that 

5(f) = -u^x u d^(f> (1.52) 

By the same argument, the Lagrangian density transforms as 

5C = -u^d^C = -d^yC) (1.53) 

where the last equality follows because = due to anti-symmetry. Once again, 
the Lagrangian changes by a total derivative so we may apply Noether's theorem (now 
with F p = —u^x^C) to find the conserved current 



dC 



= -co p 



= -uP v T>" (1.54) 



Unlike in the previous example, I've left the infinitesimal choice of in the expression 
for this current. But really, we should strip it out to give six different currents, i.e. one 
for each choice of u p u . We can write them as 

(J»y> = x PT» a - xT w (1.55) 

which satisfy d f j,(J' tl ) pcr = and give rise to 6 conserved charges. For p, a — 1,2,3, 
the Lorentz transformation is a rotation and the three conserved charges give the total 
angular momentum of the field. 

Q ij = J d 3 x (x'T ' - x j T 0i ) (1.56) 

But what about the boosts? In this case, the conserved charges are 

Q°* = f d 3 x (x°T 0i - x'T 00 ) (1.57) 



The fact that these are conserved tells us that 

= ^ = [d 3 xT 0i + t [d'x^-i- I d 3 x x'T 00 



dt J J dt dt 

= P l + t^r- - 4 [d 3 x x ! T 00 (1.58) 
dt dt J y ' 

But we know that P l is conserved, so dP l /dt = 0, leaving us with the following conse- 
quence of invariance under boosts: 

d 

di 

This is the statement that the center of energy of the field travels with a constant 
velocity. It's kind of like a field theoretic version of Newton's first law but, rather 
surprisingly, appearing here as a conservation law. 



d 3 x x l T 00 = constant (1.59) 
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1.3.4 Internal Symmetries 

The above two examples involved transformations of spacetime, as well as transforma- 
tions of the field. An internal symmetry is one that only involves a transformation of 
the fields and acts the same at every point in spacetime. The simplest example occurs 
for a complex scalar field tp(x) = (4>i(x) + i(j> 2 (x)) / 'y/2. We can build a real Lagrangian 
by 

C = d^*d^-V{\^\ 2 ) (1.60) 

where the potential is a general polynomial in \ip\ 2 = vfj*if). To find the equations of 
motion, we could expand ip in terms of 4>i an d 02 and work as before. However, it's 
easier (and equivalent) to treat ip and ip* as independent variables and vary the action 
with respect to both of them. For example, varying with respect to -0* leads to the 
equation of motion 

The Lagrangian has a continuous symmetry which rotates 0i and 2 or, equivalently, 
rotates the phase of -0: 

■0 ->■ e ta ip or 5ip = icetp (1-62) 

where the latter equation holds with a infinitesimal. The Lagrangian remains invariant 
under this change: 8C = 0. The associated conserved current is 

j» = i(d^*)^ - (1.63) 

We will later see that the conserved charges arising from currents of this type have 
the interpretation of electric charge or particle number (for example, baryon or lepton 
number) . 

Non-Abelian Internal Symmetries 

Consider a theory involving N scalar fields a , all with the same mass and the La- 
grangian 

N N / N \ 2 

£ = 2 E - 2 £ m2 ^ ~ 9 ( E # ( L64 ) 

a=l a=l \a=l / 

In this case the Lagrangian is invariant under the non-Abelian symmetry group G = 
SO(N). (Actually O(N) in this case). One can construct theories from complex fields 
in a similar manner that are invariant under an SU (N) symmetry group. Non-Abelian 
symmetries of this type are often referred to as global symmetries to distinguish them 
from the "local gauge" symmetries that you will meet later. Isospin is an example of 
such a symmetry, albeit realized only approximately in Nature. 
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Another Cute Trick 

There is a quick method to determine the conserved current associated to an internal 
symmetry 5<f) = a<p for which the Lagrangian is invariant. Here, a is a constant real 
number. (We may generalize the discussion easily to a non-Abelian internal symmetry 
for which a becomes a matrix). Now consider performing the transformation but where 
a depends on spacetime: a = a(x). The action is no longer invariant. However, the 
change must be of the form 

SC = (<V0 /^(0) (1.65) 
since we know that 5C = when a is constant. The change in the action is therefore 

5S= [ d 4 x5C = - [ d 4 x a{x) d.h" (1.66) 



which means that when the equations of motion are satisfied (so 5S = for all varia- 
tions, including 5(f) = cx(x)<p) we have 

dptf = (1.67) 

We see that we can identify the function = j M as the conserved current. This way 
of viewing things emphasizes that it is the derivative terms, not the potential terms, 
in the action that contribute to the current. (The potential terms are invariant even 
when a = a(x)). 

1.4 The Hamiltonian Formalism 

The link between the Lagrangian formalism and the quantum theory goes via the path 
integral. In this course we will not discuss path integral methods, and focus instead 
on canonical quantization. For this we need the Hamiltonian formalism of field theory. 
We start by defining the momentum 7i a (x) conjugate to 4> a {x), 

n a (x) = |£ (1.68) 

The conjugate momentum ir a (x) is a function of x, just like the field <p a (x) itself. It 
is not to be confused with the total momentum P l defined in (1.43) which is a single 
number characterizing the whole field configuration. The Hamiltonian density is given 
by 

U = 7r a (x)0 a (x) - C(x) (1.69) 

where, as in classical mechanics, we eliminate (f> a (x) in favour of 7r a (x) everywhere in 
Ti. The Hamiltonian is then simply 

H= [ d 3 xn (1.70) 
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An Example: A Real Scalar Field 

For the Lagrangian 



C = \tf - |(V0) 2 - V(<f>) (1.71) 
the momentum is given by it = 0, which gives us the Hamiltonian, 

H — j d 3 x Ivr 2 + 1(V0) 2 + V{4>) (1.72) 

Notice that the Hamiltonian agrees with the definition of the total energy (1.45) that 
we get from applying Noether's theorem for time translation invariance. 

In the Lagrangian formalism, Lorentz invariance is clear for all to see since the action 
is invariant under Lorentz transformations. In contrast, the Hamiltonian formalism is 
not manifestly Lorentz invariant: we have picked a preferred time. For example, the 
equations of motion for 4>(x) = (p(x, t) arise from Hamilton's equations, 

^ t) = M¥J) and ^ t) = -dWJ) (L73) 

which, unlike the Euler-Lagrange equations (1.6), do not look Lorentz invariant. Nev- 
ertheless, even though the Hamiltonian framework doesn't look Lorentz invariant, the 
physics must remain unchanged. If we start from a relativistic theory, all final answers 
must be Lorentz invariant even if it's not manifest at intermediate steps. We will pause 
at several points along the quantum route to check that this is indeed the case. 
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2. Free Fields 



"The career of a young theoretical physicist consists of treating the harmonic 

oscillator in ever-increasing levels of abstraction." n . , „ , 

bianey Coleman 

2.1 Canonical Quantization 

In quantum mechanics, canonical quantization is a recipe that takes us from the Hamil- 
tonian formalism of classical dynamics to the quantum theory. The recipe tells us to 
take the generalized coordinates q a and their conjugate momenta p a and promote them 
to operators. The Poisson bracket structure of classical mechanics morphs into the 
structure of commutation relations between operators, so that, in units with h — 1, 



In field theory we do the same, now for the field <f) a (x) and its momentum conjugate 
n b (x). Thus a quantum field is an operator valued function of space obeying the com- 
mutation relations 



Note that we've lost all track of Lorentz invariance since we have separated space x 
and time t. We are working in the Schrodinger picture so that the operators (f> a (x) and 
ir a (x) do not depend on time at all — only on space. All time dependence sits in the 
states \if)) which evolve by the usual Schrodinger equation 



We aren't doing anything different from usual quantum mechanics; we're merely apply- 
ing the old formalism to fields. Be warned however that the notation for the state 
is deceptively simple: if you were to write the wavefunction in quantum field theory, it 
would be a functional, that is a function of every possible configuration of the field <fi. 

The typical information we want to know about a quantum theory is the spectrum of 
the Hamiltonian H. In quantum field theories, this is usually very hard. One reason for 
this is that we have an infinite number of degrees of freedom — at least one for every 
point x in space. However, for certain theories — known as free theories — we can find 
a way to write the dynamics such that each degree of freedom evolves independently 



[qa,g b ] = [p a ,P b ] = o 



(2.1) 



[0 a (f),0 fe (y)] = K(f),7r fe (f)] = O 
[Mx),rr b (y)]= l 5 (s \x-y)5 b a 



(2.2) 




(2.3) 



- 21 - 



from all the others. Free field theories typically have Lagrangians which are quadratic 
in the fields, so that the equations of motion are linear. For example, the simplest 
relativistic free theory is the classical Klein-Gordon (KG) equation for a real scalar 
field 4>{x, t), 

d^cj) + m 2 (f) = (2.4) 

To exhibit the coordinates in which the degrees of freedom decouple from each other, 
we need only take the Fourier transform, 

<K*,t) = I -^^mt) (2.5) 



Then 0(p, t) satisfies 



§- 2 +(p 2 + m 2 )) <P(p,t) = (2.6) 



Thus, for each value of p, <p(p, t) solves the equation of a harmonic oscillator vibrating 
at frequency 



= + v / P 2 + m 2 (2.7) 



We learn that the most general solution to the KG equation is a linear superposition of 
simple harmonic oscillators, each vibrating at a different frequency with a different am- 
plitude. To quantize (f>(x, t) we must simply quantize this infinite number of harmonic 
oscillators. Let's recall how to do this. 

2.1.1 The Simple Harmonic Oscillator 

Consider the quantum mechanical Hamiltonian 

H = \p 2 + \u 2 q 2 (2.8) 

with the canonical commutation relations [q,p] = %■ To find the spectrum we define 
the creation and annihilation operators (also known as raising/lowering operators, or 
sometimes ladder operators) 

u> i t Iuj i 

2 q+ 7^ p ' a y2 q ~v^ p (2 - 9) 



which can be easily inverted to give 

q = 



|V a + at) , p=-iJ-{a-a)) (2.10) 
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Substituting into the above expressions we find 

[a, a 1 "] = 1 

while the Hamiltonian is given by 

H = ^u(aa^ + a^a) 
= oo(a)a + |) 



(2.11) 



(2.12) 



One can easily confirm that the commutators between the Hamiltonian and the creation 
and annihilation operators are given by 



[H, a)] = ua) and [H, a] = —coa 



(2.13) 



These relations ensure that a and a' take us between energy eigenstates. Let \E) be 
an eigenstate with energy E, so that H\E) = E\E). Then we can construct more 
eigenstates by acting with a and a) , 



Ha) \E) = (E + uo)a) \E) , Ha \E) — (E — u)a \E) 
So we find that the system has a ladder of states with energies 

...,E-uj,E,E + u,E + 2uj,... 



(2.14) 



(2.15) 



If the energy is bounded below, there must be a ground state |0) which satisfies a |0) = 0. 
This has ground state energy (also known as zero point energy), 

H\0) = lu\0) (2.16) 

Excited states then arise from repeated application of a\ 

|n) = (a t ) n |0) with H \n) = (n + \)u \n) (2.17) 

where I've ignored the normalization of these states so, (n| n) ^ 1. 

2.2 The Free Scalar Field 

We now apply the quantization of the harmonic oscillator to the free scalar field. We 
write and n as a linear sum of an infinite number of creation and annihilation oper- 
ators at and ap, indexed by the 3-momentum p, 



d 3 p 



(f)(x) = 



(2tt)3 ^ 
d 3 p 

Wf 



a-e ip ' x + ale 



t „-ip-S 




(2.18) 
(2.19) 
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Claim: The commutation relations for and ir are equivalent to the following com- 
mutation relations for a$ and at 

[4>{x), (f>{y)) = [tt(x), ir{y)) = ^ [a p , a ? ) = [at, at] =0 2Q 
[0(f) )7 r(y)]=^ 3 )(f-y) W [a ;? ,at] = (2vr) 3 o( 3 )(p-g) 



Proof: We'll show this just one way. Assume that [a-, at] = (27r) 3 5^ 3 ^(p — q). Then 



[X ,7T 



d 3 p d 3 g 



2tt 




d 3 p (— i) 



(2tt) 3 2 
zo" (3) (x - y) 



(2.21) 

□ 



The Hamiltonian 

Let's now compute the Hamiltonian in terms of and at. We have 



H 



d 3 x 7T 2 + (V0) 2 + m 2 



2 1.2 



1 /" d 3 x d 3 p d 3 q 

2 J (2tt)6 
1 



(a^e^ - ate-^)(a ? e i9 "- 2 - « f 



aue — ) 



m 



1 f d 3 p 1 



(age?* 3 + ate-* 3 )( aif e* 3 + ate"**) 



4 J (2vr) 3 w p - 



(-w| + p 2 + m 2 )(a^a_p>+ atat ? ) + (w| + p 2 + m 2 )(a^al + ala f ) 



where in the second line we've used the expressions for and 7r given in (2.18) and 
(2.19); to get to the third line we've integrated over d 3 x to get delta-functions 5^{p±q) 
which, in turn, allow us to perform the d 3 q integral. Now using the expression for the 
frequency oJi = p 2 + m 2 , the first term vanishes and we're left with 



H 



d 3 p 



2 J (2tt) 3 Up 
d 3 p 



°p4+ a l a p 



(27T 



4%+i( 27r ) 3j(3) (o) 



(2.22) 
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Hmmmm. We've found a delta-function, evaluated at zero where it has its infinite 
spike. Moreover, the integral over uip diverges at large p. What to do? Let's start by 
looking at the ground state where this infinity first becomes apparent. 

2.3 The Vacuum 

Following our procedure for the harmonic oscillator, let's define the vacuum |0) by 
insisting that it is annihilated by all a-, 

a # |0)=0 Vp (2.23) 

With this definition, the energy Eq of the ground state comes from the second term in 
(2.22), 



H\0) =Eo\0) 



0) = oo |0) (2.24) 



The subject of quantum field theory is rife with infinities. Each tells us something 
important, usually that we're doing something wrong, or asking the wrong question. 
Let's take some time to explore where this infinity comes from and how we should deal 
with it. 

In fact there are two different oo's lurking in the expression (2.24). The first arises 
because space is infinitely large. (Infinities of this type are often referred to as infra-red 
divergences although in this case the oo is so simple that it barely deserves this name). 
To extract out this infinity, let's consider putting the theory in a box with sides of 
length L. We impose periodic boundary conditions on the field. Then, taking the limit 
where L — > oo, we get 

f L/2 rL/2 

(2tt) 3 5 (3) (0) = lim / d 3 x e^L = lim / d 3 x = V (2.25) 

L^oo J_ L/2 'P-U L^oo J_ L/2 

where V is the volume of the box. So the 5(0) divergence arises because we're computing 
the total energy, rather than the energy density 8q. To find Sq we can simply divide by 
the volume, 

So= V=J(2^2 UJ v (2 ' 26) 
which is still infinite. We recognize it as the sum of ground state energies for each 
harmonic oscillator. But Sq — > oo due to the |p| — > oo limit of the integral. This is 
a high frequency — or short distance — infinity known as an ultra-violet divergence. 
This divergence arises because of our hubris. We've assumed that our theory is valid 
to arbitrarily short distance scales, corresponding to arbitrarily high energies. This is 
clearly absurd. The integral should be cut-off at high momentum in order to reflect 
the fact that our theory is likely to break down in some way 
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We can deal with the infinity in (2.24) in a more practical way. In physics we're 
only interested in energy differences. There's no way to measure Eq directly, so we can 
simply redefine the Hamiltonian by subtracting off this infinity, 



H = I 7^3 ^4 a P ( 2 - 27 ) 



d 3 p 

so that, with this new definition, H |0) = 0. In fact, the difference between this Hamilto- 
nian and the previous one is merely an ordering ambiguity in moving from the classical 
theory to the quantum theory. For example, if we defined the Hamiltonian of the har- 
monic oscillator to be H = (l/2)(uq — ip)(ojq + ip), which is classically the same as our 
original choice, then upon quantization it would naturally give H = ua^a as in (2.27). 
This type of ordering ambiguity arises a lot in field theories. We'll come across a number 
of ways of dealing with it. The method that we've used above is called normal ordering. 

Definition: We write the normal ordered string of operators <pi(xi) . . . <f) n (x n ) as 

:0i(£i)...0„(x„): (2.28) 

It is defined to be the usual product with all annihilation operators a-, placed to the 
right. So, for the Hamiltonian, we could write (2.27) as 

:H:= Jwf u ^ (2 - 29) 

In the remainder of this section, we will normal order all operators in this manner. 
2.3.1 The Cosmological Constant 

Above I wrote "there's no way to measure Eq directly". There is a BIG caveat here: 
gravity is supposed to see everything! The sum of all the zero point energies should 
contribute to the stress-energy tensor that appears on the right-hand side of Einstein's 
equations. We expect them to appear as a cosmological constant A = Eq/V, 

R^u - \g^u = SttGT^ + A 9lMU (2.30) 

Current observation suggests that 70% of the energy density in the universe has the 
properties of a cosmological constant with A ~ (10~ 3 eV) 4 . This is much smaller than 
other scales in particle physics. In particular, the Standard Model is valid at least up 
to 10 12 eV. Why don't the zero point energies of these fields contribute to A? Or, if 
they do, what cancels them to such high accuracy? This is the cosmological constant 
problem. No one knows the answer! 
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2.3.2 The Casimir Effect 



"I mentioned my results to Niels Bohr, during a walk. That is nice, he 
said, that is something new... and he mumbled something about zero-point 

energy. Hendrik Casimir 

Using the normal ordering prescription we can happily set E = 0, while chanting 
the mantra that only energy differences can be measured. But we should be careful, for 
there is a situation where differences in the energy of vacuum fluctuations themselves 
can be measured. 

To regulate the infra-red divergences, we'll make the x 1 direction periodic, with size 
L, and impose periodic boundary conditions such that 



(x + Ln) 



(2.31) 



with n = (1,0,0). We'll leave y and z alone, but remember 
that we should compute all physical quantities per unit area 
A. We insert two reflecting plates, separated by a distance 
(I < L in the x 1 direction. The plates are such that they 
impose (p(x) = at the position of the plates. The presence of 
these plates affects the Fourier decomposition of the field and, 
in particular, means that the momentum of the field inside the 
plates is quantized as 



V 



nit 



( d 



,Py,P* 



nez- 



(2.32) 



Figure 3: 



For a massless scalar field, the ground state energy between the plates is 
E{d) 



E 



d Py d Pz lJ ( rmy +pl+pl 



(2.33) 



A (2vr) 2 2 V V d J 

71=1 

while the energy outside the plates is E(L — d). The total energy is therefore 

E = E(d) + E(L - d) (2.34) 

which - at least naively - depends on d. If this naive guess is true, it would mean 
that there is a force on the plates due to the fluctuations of the vacuum. This is the 
Casimir force, first predicted in 1948 and observed 10 years later. In the real world, the 
effect is due to the vacuum fluctuations of the electromagnetic field, with the boundary 
conditions imposed by conducting plates. Here we model this effect with a scalar. 
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But there's a problem. E is infinite! What to do? The problem comes from the 
arbitrarily high momentum modes. We could regulate this in a number of different 
ways. Physically one could argue that any real plate cannot reflect waves of arbitrarily 
high frequency: at some point, things begin to leak. Mathematically, we want to find 
a way to neglect modes of momentum p ^> a" 1 for some distance scale a <C d, known 
as the ultra-violet (UV) cut-off. One way to do this is to change the integral (2.33) to, 

*f = £f^ (7(f) 2 + PS+P?) e-^W^, (2.35) 

which has the property that as a — > 0, we regain the full, infinite, expression (2.33). 
However (2.35) is finite, and gives us something we can easily work with. Of course, 
we made it finite in a rather ad-hoc manner and we better make sure that any physical 
quantity we calculate doesn't depend on the UV cut-off a, otherwise it's not something 
we can really trust. 

The integral (2.35) is do-able, but a little complicated. It's a lot simpler if we look 
at the problem in d = 1 + 1 dimensions, rather than d = 3 + 1 dimensions. We'll find 
that all the same physics is at play. Now the energy is given by 

oo 

7T 



£i + iW = ^E n ( 2 - 36 ) 

71=1 

We now regulate this sum by introducing the UV cutoff a introduced above. This 
renders the expression finite, allowing us to start manipulating it thus, 

oo 

E 1+1 (d) -^YdX. ne~ a ™l d 

n=l 

2 da ^ 

n=l 

1 d 1 



2 da 1 - e~ a ^/ d 
ix e aw/d 



2 



2d (e an / d - 1) 

+ 0{a 2 ) (2.37) 



2na 2 24d 

where, in the last line, we've used the fact that a d. We can now compute the full 
energy, 

E 1+1 = E»M + E M (L - d) = ^-^n + T L\ + OM (2.38) 
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This is still infinite in the limit a — > 0, which is to be expected. However, the force is 
given by 



9E 1+1 7T 



dd 24d 2 



+ ... (2.39) 



where the . . . include terms of size d/L and a/d. The key point is that as we remove 
both the regulators, and take a — > and L — > oo, the force between the plates remains 
finite. This is the Casimir force 2 . 

If we ploughed through the analogous calculation in d — 3 + 1 dimensions, and 
performed the integral (2.35), we would find the result 

I^ = ^l_ (2 40) 

A dd 480d 4 1 ' ; 

The true Casimir force is twice as large as this, due to the two polarization states of 
the photon. 

2.4 Particles 

Having dealt with the vacuum, we can now turn to the excitations of the field. It's 
easy to verify that 



H, at] = LOpat and [H, a ? ] = -upa^ ( 2 -41) 



which means that, just as for the harmonic oscillator, we can construct energy eigen- 
states by acting on the vacuum |0) with at. Let 

[p) = 4|0> (2.42) 

This state has energy 

H \p) = ujp \p) with ujl = p 2 + m 2 (2.43) 

But we recognize this as the relativistic dispersion relation for a particle of mass m and 
3-momentum p, 

El = p 2 + m 2 (2.44) 



2 Thc number 24 that appears in the denominator of the one-dimensional Casimir force plays a more 
famous role in string theory: the same calculation in that context is the reason the bosonic string lives 
in 26 = 24 + 2 spacetimc dimensions. (The +2 comes from the fact the string itself is extended in one 
space and one time dimension). You will need to attend next term's "String Theory" course to see 
what on earth this has to do with the Casimir force. 
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We interpret the state \p) as the momentum eigenstate of a single particle of mass m. 
To stress this, from now on we'll write Ep everywhere instead of up. Let's check this 
particle interpretation by studying the other quantum numbers of \p). We may take 
the classical total momentum P given in (1.46) and turn it into an operator. After 
normal ordering, it becomes 

P = -jd 3 xnV<P = j^- payoff (2.45) 

Acting on our state \p) with P, we learn that it is indeed an eigenstate, 

P\P) =P\P) (2.46) 

telling us that the state \p) has momentum p. Another property of \p) that we can 
study is its angular momentum. Once again, we may take the classical expression for 
the total angular momentum of the field (1.55) and turn it into an operator, 

It's not hard to show that acting on the one-particle state with zero momentum, 
J 1 \p = 0) = 0, which we interpret as telling us that the particle carries no internal 
angular momentum. In other words, quantizing a scalar field gives rise to a spin 
particle. 

Multi-Particle States, Bosonic Statistics and Fock Space 

We can create multi-particle states by acting multiple times with a^'s. We interpret 
the state in which n cr's act on the vacuum as an n-particle state, 

\p 1 ,...,p n )=^ i ...al |0> (2.48) 

Because all the a''s commute among themselves, the state is symmetric under exchange 
of any two particles. For example, 

\p,q) = \q,p) (2.49) 
This means that the particles are bosons. 

The full Hilbert space of our theory is spanned by acting on the vacuum with all 
possible combinations of a''s, 

|o> , 4io> , 44io> , 444 io>... (2.50) 
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This space is known as a Fock space. The Fock space is simply the sum of the n-particle 
Hilbert spaces, for all n > 0. There is a useful operator which counts the number of 
particles in a given state in the Fock space. It is called the number operator N 



and satisfies N \p ly . . . ,p n ) = n \pi, . . . ,p n ). The number operator commutes with the 
Hamiltonian, [N, H] = 0, ensuring that particle number is conserved. This means that 
we can place ourselves in the n-particle sector, and stay there. This is a property of 
free theories, but will no longer be true when we consider interactions: interactions 
create and destroy particles, taking us between the different sectors in the Fock space. 

Operator Valued Distributions 

Although we're referring to the states \p) as "particles" , they're not localized in space 
in any way — they are momentum eigenstates. Recall that in quantum mechanics the 
position and momentum eigenstates are not good elements of the Hilbert space since 
they are not normalizable (they normalize to delta-functions). Similarly, in quantum 
field theory neither the operators (f>(x), nor a- are good operators acting on the Fock 
space. This is because they don't produce normalizable states. For example, 



(0|^4|0) = (p>-) = (27r) 3 5(0) and (0| <f>(x) 4>{x) |0> = <£| f) = 5(0) (2.52) 



They are operator valued distributions, rather than functions. This means that al- 
though 4>(x) has a well defined vacuum expectation value, (0| <p(x) |0) = 0, the fluc- 
tuations of the operator at a fixed point are infinite, (0\<f)(x)<j)(x) |0) = oo. We can 
construct well defined operators by smearing these distributions over space. For exam- 
ple, we can create a wavepacket 



which is partially localized in both position and momentum space. (A typical state 
might be described by the Gaussian f(p)= exp(— p 2 /2m 2 )). 

2.4.1 Relativistic Normalization 

We have defined the vacuum |0) which we normalize as (0 1 0) = 1. The one-particle 
states \p) = at|0) then satisfy 




(2.51) 




(2.53) 



(p\q) = W^\p-q) 



(2.54) 
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But is this Lorentz invariant? It's not obvious because we only have 3- vectors. What 
could go wrong? Suppose we have a Lorentz transformation 

p» ( p y = A^p v (2.55) 

such that the 3- vector transforms as p — > p' . In the quantum theory, it would be 
preferable if the two states are related by a unitary transformation, 

\p) \p') = U(A) \p) (2.56) 

This would mean that the normalizations of \p) and \p') are the same whenever p and 
p' are related by a Lorentz transformation. But we haven't been at all careful with 
normalizations. In general, we could get 

\p)^HP,P') W) (2-57) 

for some unknown function X(p,p'). How do we figure this out? The trick is to look at 
an object which we know is Lorentz invariant. One such object is the identity operator 
on one-particle states (which is really the projection operator onto one-particle states). 
With the normalization (2.54) we know this is given by 

1 = Iwf lPl {pl (2 ' 58) 

This operator is Lorentz invariant, but it consists of two terms: the measure J d 3 p and 
the projector \p){p\- Are these individually Lorentz invariant? In fact the answer is no. 



Claim The Lorentz invariant measure is, 



d 3 p 
2Eft 



(2.59) 



Proof: J d A p is obviously Lorentz invariant. And the relativistic dispersion relation 
for a massive particle, 



PpP^ = m 2 



Po 



E = P 2 + ™ 2 



(2.60) 



is also Lorentz invariant. Solving for po, there are two branches of solutions: po = ±Ep. 
But the choice of branch is another Lorentz invariant concept. So piecing everything 
together, the following combination must be Lorentz invariant, 



J d 4 p 5(pl — p 2 — m 2 
which completes the proof. 





f d 3 p 


Po>0 " 


1 2p~ 



Po=Ep 



(2.61) 
□ 
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From this result we can figure out everything else. For example, the Lorentz invariant 
5-function for 3-vectors is 



which follows because 



2Ef?5 i3 \p-q) (2.62) 
d 3 p 



2E i! 8 {3 \p-q) = l (2.63) 



So finally we learn that the relativistically normalized momentum states are given by 

\ P ) = y^Ep \p) = V^4l°) ( 2 - 64 ) 

Notice that our notation is rather subtle: the relativistically normalized momentum 
state \p) differs from \p) by the factor ^J2E^. These states now satisfy 

(p\q) = (27r) 3 2E p 8®(p-$) (2.65) 

Finally, we can rewrite the identity on one-particle states as 

d 3 p 1 



(2tt) 3 2E P - 



\p) (p\ (2.66) 

Some texts also define relativistically normalized creation operators by a){p) = ^2E^at. 
We won't make use of this notation here. 

2.5 Complex Scalar Fields 

Consider a complex scalar field ip(x) with Lagrangian 

C = <9 M V* d^tp - M^if; (2.67) 

Notice that, in contrast to the Lagrangian (1.7) for a real scalar field, there is no factor 
of 1/2 in front of the Lagrangian for a complex scalar field. If we write ip in terms 
of real scalar fields by ip = (0i + i02)/\ // 2, we get the factor of 1/2 coming from the 
l/\/2's. The equations of motion are 

i) + M 2 ^ = 
d^i/j* + M 2 f = (2.68) 

where the second equation is the complex conjugate of the first. We expand the complex 
field operator as a sum of plane waves as 

* ,= /(&vk^ e "' i+Cfe+ ' fi ) (2 - 69) 
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Since the classical field ip is not real, the corresponding quantum field ip is not hermitian. 
This is the reason that we have different operators b and c' appearing in the positive 
and negative frequency parts. The classical field momentum is tc = d£/dip = ip*. We 
also turn this into a quantum operator field which we write as, 

7T 
71-t 

The commutation relations between fields and momenta are given by 

[iP(x),7r(y)}=t6 ( - 3 \x-y) and [iP(x), ^(y)] = (2.71) 

together with others related by complex conjugation, as well as the usual [ip(x),ip(y)] = 
[ip(x),ip*(y)] = 0, etc. One can easily check that these field commutation relations are 
equivalent to the commutation relations for the operators and cp, 

[b p ,b^ = (2n) 3 S^(p-q) 

[c f ,c\] = (2*)H®{p-q) (2.72) 

and 

In summary, quantizing a complex scalar field gives rise to two creation operators, &t 
and ct. These have the interpretation of creating two types of particle, both of mass M 
and both spin zero. They are interpreted as particles and anti-particles. In contrast, 
for a real scalar field there is only a single type of particle: for a real scalar field, the 
particle is its own antiparticle. 

Recall that the theory (2.67) has a classical conserved charge 

Q = if d s X ^-W) = ij^W-^) (2.74) 
After normal ordering, this becomes the quantum operator 

Q = Iwf ^ ~ = Nc ~ Nb (2 - 75) 

so Q counts the number of anti-particles (created by c') minus the number of particles 
(created by 6+). We have [H,Q] = 0, ensuring that Q is conserved quantity in the 
quantum theory. Of course, in our free field theory this isn't such a big deal because 
both N c and 7V& are separately conserved. However, we'll soon see that in interacting 
theories Q survives as a conserved quantity, while N c and N b individually do not. 



d 3 p 
d 3 p 




-i) 



■" < b p e +if * - cte- if: 




(2.70) 
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2.6 The Heisenberg Picture 

Although we started with a Lorentz invariant Lagrangian, we slowly butchered it as we 
quantized, introducing a preferred time coordinate t. It's not at all obvious that the 
theory is still Lorentz invariant after quantization. For example, the operators <p{x) 
depend on space, but not on time. Meanwhile, the one-particle states evolve in time 
by Schrodinger's equation, 

% d\P^ = Hm) ^ m) = e - lEp t m (2?6) 

Things start to look better in the Heisenberg picture where time dependence is assigned 
to the operators O, 

O h = e lHt O s e- lHt (2.77) 

so that 



dt 



i[H,0 H ] (2.78) 



where the subscripts S and H tell us whether the operator is in the Schrodinger or 
Heisenberg picture. In field theory, we drop these subscripts and we will denote the 
picture by specifying whether the fields depend on space 4>(x) (the Schrodinger picture) 
or spacetime (f>(x,t) = (f>(x) (the Heisenberg picture). 

The operators in the two pictures agree at a fixed time, say, t = 0. The commutation 
relations (2.2) become equal time commutation relations in the Heisenberg picture, 

[(f)(x,t),(f)(y,t)} = [ir(x,t),Tr(y,t)}=0 

{<P(x,t),7r(y,t)}=i5W(x-y) (2.79) 

Now that the operator 4>(x) = 4>(x, t) depends on time, we can start to study how it 
evolves. For example, we have 

= i[H,<f>] = l -[j d 3 y vr(y) 2 + V0(y) 2 + m 2 0(y) 2 , <f>(x)] 

= i d 3 y ir(y) (-i) 5 (3) {y - x) = n(x) (2.80) 



Meanwhile, the equation of motion for n reads, 



7T = i[H, tt] = - [ / d 6 y ir(y) 2 + V0(y) 2 + m^y) 2 , n(x)\ 
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J d 3 y (v y [4>(y),7r(x)})vm + ^<f>(y)^My)M^)} 

+2im 2 (j)(y) 5 i3) (x-y) 
j d?y (V y 6W(x-#))V y <t>(y) \ -m 2 0(a;) 
= V 2 0-m 2 (2-81) 

where we've included the subscript y on V y when there may be some confusion about 
which argument the derivative is acting on. To reach the last line, we've simply inte- 
grated by parts. Putting (2.80) and (2.81) together we find that the field operator <fi 
satisfies the Klein-Gordon equation 

d^cf) + m 2 = (2.82) 

Things are beginning to look more relativistic. We can write the Fourier expansion of 
4>(x) by using the definition (2.77) and noting, 

e iHt a p e- iHt = e'^ 1 a$ and e iHt ale~ im = e +iE ^ at (2.83) 

which follows from the commutation relations [H, = —Epa^ and [H, at] = +£^at. 
This then gives, 

which looks very similar to the previous expansion (2.18) 
except that the exponent is now written in terms of 4- 
vectors, p ■ x = Ept — p ■ x. (Note also that a sign has 
flipped in the exponent due to our Minkowski metric con- 
traction). It's simple to check that (2.84) indeed satisfies 
the Klein-Gordon equation (2.82). 





2.6.1 Causality 



Figure 4: 



We're approaching something Lorentz invariant in the 
Heisenberg picture, where 4>(x) now satisfies the Klein- 
Gordon equation. But there's still a hint of non-Lorentz invariance because 
satisfy equal time commutation relations, 



and 7r 



[<j)(x,t),ir(y,t)] =i5^\x-y) 



(2.85) 
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But what about arbitrary spacetime separations? In particular, for our theory to be 
causal, we must require that all spacelike separated operators commute, 

[O 1 (x),O 2 (y)]=0 V (x-y) 2 <0 (2.86) 

This ensures that a measurement at x cannot affect a measurement at y when x and y 
are not causally connected. Does our theory satisfy this crucial property? Let's define 

A(x-y) = [<P(x),<P(y)} (2.87) 

The objects on the right-hand side of this expression are operators. However, it's easy 
to check by direct substitution that the left-hand side is simply a c-number function 
with the integral expression 

A(x - y) = J ±- (e-*-^) - e*-M) (2.88) 

What do we know about this function? 

• It's Lorentz invariant, thanks to the appearance of the Lorentz invariant measure 
/ d 3 p/2Ep that we introduced in (2.59). 

• It doesn't vanish for timelike separation. For example, taking x — y = (t, 0, 0, 0) 
gives [<j)(x, 0), <f>(x,t)] ~ e~ imt - e +imt . 

• It vanishes for space-like separations. This follows by noting that A(x — y) = 
at equal times for all (x — y) 2 = — (x — y) 2 < 0, which we can see explicitly by 
writing 

[0(x, t), m *)] = §/ ~7== 2 P M - e-*^) (2.89) 

and noticing that we can flip the sign of p in the last exponent as it is an inte- 
gration variable. But since A(x — y) Lorentz invariant, it can only depend on 
(x — y) 2 and must therefore vanish for all (x — y) 2 < 0. 

We therefore learn that our theory is indeed causal with commutators vanishing 
outside the lightcone. This property will continue to hold in interacting theories; indeed, 
it is usually given as one of the axioms of local quantum field theories. I should mention 
however that the fact that [<ft(x), 4>{y)} is a c-number function, rather than an operator, 
is a property of free fields only. 
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2.7 Propagators 

We could ask a different question to probe the causal structure of the theory. Prepare 
a particle at spacetime point y. What is the amplitude to find it at point xl We can 
calculate this: 

(0| |0) = / *0- 7 ^ s; (0| 1 0) e-+-~* 

= SwfW* l *~ ,] = D{x - v) (2M) 

The function D{x — y) is called the propagator. For spacelike separations, (x — y) 2 < 0, 
one can show that D(x — y) decays like 

D(x - y) ~ e^-^ (2.91) 

So it decays exponentially quickly outside the lightcone but, nonetheless, is non- vanishing 
The quantum field appears to leak out of the lightcone. Yet we've just seen that space- 
like measurements commute and the theory is causal. How do we reconcile these two 
facts? We can rewrite the calculation (2.89) as 

[0(x), <f>{y)\ = D(x -y)- D{y - x) = if (x - yf < (2.92) 

There are words you can drape around this calculation. When (x — y) 2 < 0, there 
is no Lorentz invariant way to order events. If a particle can travel in a spacelike 
direction from x — > y, it can just as easily travel from y — > x. In any measurement, the 
amplitudes for these two events cancel. 

With a complex scalar field, it is more interesting. We can look at the equation 
[il)(x),ij)'(y)] = outside the lightcone. The interpretation now is that the amplitude 
for the particle to propagate from x — > y cancels the amplitude for the antiparticle 
to travel from y — y x. In fact, this interpretation is also there for a real scalar field 
because the particle is its own antiparticle. 

2.7.1 The Feynman Propagator 

As we will see shortly, one of the most important quantities in interacting field theory 
is the Feynman propagator, 

{T)( x — y) x° > V° 

o o ( 2 - 93 ) 
D(y — x) y u > x u 
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where T stands for time ordering, placing all operators evaluated at later times to the 
left so, 



T<j>(x)<j>(y) 



<p{x)<p(y) 
<f>(y)<f>(x) 



x° > y° 
y° > x° 



(2.94) 



Claim: There is a useful way of writing the Feynman propagator in terms of a 4- 
momentum integral. 

d 4 p i 



A F (x - y) 



-ip-(x-y) 



(2.95) 



(2tt) 4 p 2 

Notice that this is the first time in this course that we've integrated over 4-momentum. 
Until now, we integrated only over 3-momentum, with p° fixed by the mass-shell con- 
dition to be p° = Ejj. In the expression (2.95) for Ap, we have no such condition on 
p°. However, as it stands this integral is ill-defined because, for each value of p, the 



denominator p 2 



■m 



0\2_ 



■p' 



■m 2 produces a pole when p° = ±Ep = ± \Jp 2 + m 2 . 



We need a prescription for avoiding these singularities in the po integral. To get the 
Feynman propagator, we must choose the contour to be 

Im(pQ) 



Re(p°) 





+E„ 



Figure 5: The contour for the Feynman propagator. 

Proof: 



(2.96) 



p 2 -m 2 (p°) 2 - E 2 (po _ e p ) (p° + E p ) 

so the residue of the pole at p° = ±Ep is ±l/2Ep. When x° > y°, we close the contour 
in the lower half plane, where p° — > —too, ensuring that the integrand vanishes since 
e -ip°(x°-y ) q rj^g integral over p° then picks up the residue at p° = +Ep which 
is —27ri/2Ef where the minus sign arises because we took a clockwise contour. Hence 
when x° > y° we have 
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(l "l> 1 e -iP-(*-y) = D ( x _ y ) (2.97) 



(2vr) 3 2E, 



p 



which is indeed the Feynman propagator for x° > y°. In contrast, when y° > x°, we 
close the contour in an anti-clockwise direction in the upper half plane to get, 

d 3 p 2iri . +iE ^ x o_ y o^ +i ^.^_^ 



A F (x — y) = / 7 — —r-. — ie 

' 1 (2tt) 4 {-2E£ 



d3 P 1 _- iE ^y -x )-ip-(y-£) 



e 



(2tt) 3 2E p 

e ^v\y-*) = D ( y _ x ) (2.98) 



d3 P 1 .-ip(y-x) 



(2tt) 3 2E p 

where to go to from the second line to the third, we have flipped the sign of p which 
is valid since we integrate over d 3 p and all other quantities depend only on p 2 . Once 
again we reproduce the Feynman propagator. □ 

Instead of specifying the contour, it is standard to write im(p°) 
the Feynman propagator as 

a / v f d A p i£*^» \+* Re(pO) 

A F (x -y)= yz rj — 2 YV~ 2 - 99 / r~ 

J (27t) 4 p z — m z + xe I -> e 

with e > 0, and infinitesimal. This has the effect of shifting 

the poles slightly off the real axis, so the integral along the 

real p° axis is equivalent to the contour shown in Figure 5. Figure 6. 

This way of writing the propagator is, for obvious reasons, 

called the u ie prescription". 

2.7.2 Green's Functions 

There is another avatar of the propagator: it is a Green's function for the Klein-Gordon 
operator. If we stay away from the singularities, we have 

{dl - V 2 + m 2 )A F (x -V)= [(% ~^— 2 (-P 2 + ™ 2 ) 

J (27TJ 4 p z — m z 

d 4 P --ivix-y) 



e 



(27T)* 

= -i5 {i \x-y) (2.100) 

Note that we didn't make use of the contour anywhere in this derivation. For some 
purposes it is also useful to pick other contours which also give rise to Green's functions. 
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Im(p») 



Re(pO) 



+E„ 



Im(pO) 



+E P Re(p») 



Figure 7: The retarded contour 



Figure 8: The advanced contour 



For example, the retarded Green's function Ar(x — y) is defined by the contour shown 
in Figure 7 which has the property 



A R (x -y) = 



D(x — y) — D(y — x) x° > y° 







y° >x° 



(2.101) 



The retarded Green's function is useful in classical field theory if we know the initial 
value of some field configuration and want to figure out what it evolves into in the 
presence of a source, meaning that we want to know the solution to the inhomogeneous 
Klein-Gordon equation, 



d^(f) + m 2 (f) = J(x) 



(2.102) 



for some fixed background function J(x). Similarly, one can define the advanced Green's 
function Aa(x — y) which vanishes when y° < x°, which is useful if we know the end 
point of a field configuration and want to figure out where it came from. Given that 
next term's course is called "Advanced Quantum Field Theory", there is an obvious 
name for the current course. But it got shot down in the staff meeting. In the quantum 
theory, we will see that the Feynman Green's function is most relevant. 



2.8 Non-Relativistic Fields 

Let's return to our classical complex scalar field obeying the Klein-Gordon equation. 
We'll decompose the field as 



^(x,t) = e" im V(f,t) 



Then the KG-equation reads 



-imt 



tp — 2imip — \7 2 ip 







(2.103) 



(2.104) 



with the m 2 term cancelled by the time derivatives. The non-relativistic limit of a 
particle is |pj m. Let's look at what this does to our field. After a Fourier transform, 
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this is equivalent to saying that <C m|-?/>|. In this limit, we drop the term with two 
time derivatives and the KG equation becomes, 

This looks very similar to the Schrodinger equation for a non-relativistic free particle of 
mass m. Except it doesn't have any probability interpretation — it's simply a classical 
field evolving through an equation that's first order in time derivatives. 

We wrote down a Lagrangian in section 1.1.2 which gives rise to field equations 
which are first order in time derivatives. In fact, we can derive this from the relativistic 
Lagrangian for a scalar field by again taking the limit dtip <C mip. After losing the 
tilde, so %jj — > if), the non-relativistic Lagrangian becomes 

C = +iib*iL - — (2.106) 
2m 

where we've divided by l/2m. This Lagrangian has a conserved current arising from 
the internal symmetry ip — > e ia ip. The current has time and space components 

f = ^(^W - W*)J (2.107) 

To move to the Hamiltonian formalism we compute the momentum 

dC 

7T = — ^ = «V* (2.108) 

di/j 

This means that the momentum conjugate to ip is iip*. The momentum does not depend 
on time derivatives at all! This looks a little disconcerting but it's fully consistent for a 
theory which is first order in time derivatives. In order to determine the full trajectory 
of the field, we need only specify ip and ip* at time t — 0: no time derivatives on the 
initial slice are required. 

Since the Lagrangian already contains a u pq" term (instead of the more familiar ^pq 
term), the time derivatives drop out when we compute the Hamiltonian. We get, 

n = — V^Vijj (2.109) 
2m 

To quantize we impose (in the Schrodinger picture) the canonical commutation relations 

li)(x)^(y)] = i^(x),^(y)] = 

mx),^(y)}=S i3) (x-y) (2.110) 
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We may expand i/j(x) Fourier transform 




ip-x 



(2.111) 



where the commutation relations (2.110) require 




(2.112) 



The vacuum satisfies a^\0) = 0, and the excitations are at 
states have energy 



at |0). The one-particle 



H\p) 



2m 



(2.113) 



which is the non-relativistic dispersion relation. We conclude that quantizing the first 
order Lagrangian (2.106) gives rise to non-relativistic particles of mass m. Some com- 
ments: 

• We have a complex field but only a single type of particle. The anti-particle is 
not in the spectrum. The existence of anti-particles is a consequence of relativity. 

• A related fact is that the conserved charge Q = J d 3 x : i/)'ip : is the particle 
number. This remains conserved even if we include interactions in the Lagrangian 
of the form 



So in non-relativistic theories, particle number is conserved. It is only with rela- 
tivity, and the appearance of anti-particles, that particle number can change. 

• There is no non-relativistic limit of a real scalar field. In the relativistic theory, 
the particles are their own anti-particles, and there can be no way to construct a 
multi-particle theory that conserves particle number. 

2.8.1 Recovering Quantum Mechanics 

In quantum mechanics, we talk about the position and momentum operators X and 
P. In quantum field theory, position is relegated to a label. How do we get back to 
quantum mechanics? We already have the operator for the total momentum of the field 



AC = V(if>*i>) 



(2.114) 




(2.115) 
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which, on one-particle states, gives P\p) = p\p). It's also easy to construct the position 
operator. Let's work in the non-relativistic limit. Then the operator 

^ffl= I 7^4e"* 2 (2-116) 



., (2tt) 3 p 

creates a particle with (^-function localization at x. We write \x) = i]fi(x) |0). A natural 
position operator is then 

X = J d 3 x x^{x)^{x) (2.117) 

so that X \x) = x\x). 

Let's now construct a state \ip) by taking superpositions of one-particle states \x), 

\cp) = J d 3 x ip(x) \x) (2.118) 

The function tp(x) is what we would usually call the Schrodinger wavefunction (in the 
position representation). Let's make sure that it indeed satisfies all the right properties. 
Firstly, it's clear that acting with the position operator X has the right action of (p(x), 

X i \<p) = J d 3 x x>(£) \x) (2.119) 

but what about the momentum operator P? We will now show that 

P>>= J d'x (-ij^) |5> (2-120) 

which tells us that P % acts as the familiar derivative on wavefunctions \ip). To see that 
this is the case, we write 

= / w^ e ^ >(a7) |0) (2,121) 

where we've used the relationship [a^tjr{x)\ = e~ ip ' x which can be easily checked. 
Proceeding with our calculation, we have 



d xd P e -v* ] „t 



(2tt) 3 V dxi 

. d(f 
dx 



4 io> 



d*x \.r) (2.122) 
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which confirms (2.120). So we learn that when acting on one-particle states, the oper- 
ators X and P act as position and momentum operators in quantum mechanics, with 
[X l ,P J ] \tp) = iS 1 ^ \cp). But what about dynamics? How does the wavefunction <p(x,t) 
change in time? The Hamiltonian (2.109) can be rewritten as 

H= [ A-J-VfV^ / ^^flU (2.123) 



2m T r J (2vr) 3 2m 



so we find that 



But this is the same equation obeyed by the original field (2.105)! Except this time, it 
really is the Schrodinger equation, complete with the usual probabilistic interpretation 
for the wavefunction tp. Note in particular that the conserved charge arising from the 
Noether current (2.107) is Q = J d 3 x \ip(x)\ 2 which is the total probability. 



Historically, the fact that the equation for the classical field (2.105) and the one- 
particle wavefunction (2.124) coincide caused some confusion. It was thought that 
perhaps we are quantizing the wavefunction itself and the resulting name "second quan- 
tization" is still sometimes used today to mean quantum field theory. It's important to 
stress that, despite the name, we're not quantizing anything twice! We simply quantize 
a classical field once. Nonetheless, in practice it's useful to know that if we treat the 
one-particle Schrodinger equation as the equation for a quantum field then it will give 
the correct generalization to multi-particle states. 



Interactions 

Often in quantum mechanics, we're interested in particles moving in some fixed back- 
ground potential V(x). This can be easily incorporated into field theory by working 
with a Lagrangian with explicit x dependence, 

C = ii/,*if,-—'Vil>*Vi/,-V(x)if>*i/> (2.125) 
2m 

Note that this Lagrangian doesn't respect translational symmetry and we won't have 
the associated energy-momentum tensor. While such Lagrangians are useful in con- 
densed matter physics, we rarely (or never) come across them in high-energy physics, 
where all equations obey translational (and Lorentz) invariance. 
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One can also consider interactions between particles. Obviously these are only impor- 
tant for n particle states with n > 2. We therefore expect them to arise from additions 
to the Lagrangian of the form 

AC = ip*{x) ip*(x) ip(x) ip(x) (2.126) 

which, in the quantum theory, is an operator which destroys two particles before creat- 
ing two new ones. Such terms in the Lagrangian will indeed lead to inter-particle forces, 
both in the non-relativistic and relativistic setting. In the next section we explore these 
types of interaction in detail for relativistic theories. 
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3. Interacting Fields 

The free field theories that we've discussed so far are very special: we can determine 
their spectrum, but nothing interesting then happens. They have particle excitations, 
but these particles don't interact with each other. 

Here we'll start to examine more complicated theories that include interaction terms. 
These will take the form of higher order terms in the Lagrangian. We'll start by asking 
what kind of small perturbations we can add to the theory. For example, consider the 
Lagrangian for a real scalar field, 

£=^W-imV-X;^T^ (3-1) 

n>3 

The coefficients A n are called coupling constants. What restrictions do we have on A n 
to ensure that the additional terms are small perturbations? You might think that we 
need simply make "A n <C 1". But this isn't quite right. To see why this is the case, let's 
do some dimensional analysis. Firstly, note that the action has dimensions of angular 
momentum or, equivalently, the same dimensions as h. Since we've set h = 1, using 
the convention described in the introduction, we have [S] = 0. With S = J d 4 x £, and 
[d^x] = -4, the La grangian density must therefore have 

[C\ = 4 (3.2) 

What does this mean for the Lagrangian (3.1)? Since [<9 M ] = 1, we can read off the 
mass dimensions of all the factors to find, 

[<f>] = l , [m] = l , [A„]=4-n (3.3) 

So now we see why we can't simply say we need A n < 1, because this statement only 
makes sense for dimensionless quantities. The various terms, parameterized by A n , fall 
into three different categories 

• [A3] = 1: For this term, the dimensionless parameter is X3/E, where E has 
dimensions of mass. Typically in quantum field theory, E is the energy scale of 
the process of interest. This means that A 3 ^ 3 /3! is a small perturbation at high 
energies E ^> A3, but a large perturbation at low energies £ C A 3 . Terms that 
we add to the Lagrangian with this behavior are called relevant because they're 
most relevant at low energies (which, after all, is where most of the physics we see 
lies). In a relativistic theory, E > m, so we can always make this perturbation 
small by taking A3 <C m. 
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• [AJ = 0: this term is small if A4 1. Such perturbations are called marginal. 

• [A n ] < for n > 5: The dimensionless parameter is (X n E n ~ A ), which is small at 
low-energies and large at high energies. Such perturbations are called irrelevant. 

As you'll see later, it is typically impossible to avoid high energy processes in quantum 
field theory. (We've already seen a glimpse of this in computing the vacuum energy). 
This means that we might expect problems with irrelevant operators. Indeed, these lead 
to "non-renormalizable" field theories in which one cannot make sense of the infinities 
at arbitrarily high energies. This doesn't necessarily mean that the theory is useless; 
just that it is incomplete at some energy scale. 

Let me note however that the naive assignment of relevant, marginal and irrelevant 
is not always fixed in stone: quantum corrections can sometimes change the character 
of an operator. 

An Important Aside: Why QFT is Simple 

Typically in a quantum field theory, only the relevant and marginal couplings are 
important. This is basically because, as we've seen above, the irrelevant couplings 
become small at low-energies. This is a huge help: of the infinite number of interaction 
terms that we could write down, only a handful are actually needed (just two in the 
case of the real scalar field described above). 

Let's look at this a little more. Suppose that we some day discover the true su- 
perduper "theory of everything unimportant" that describes the world at very high 
energy scales, say the GUT scale, or the Planck scale. Whatever this scale is, let's call 
it A. It is an energy scale, so [A] = 1. Now we want to understand the laws of physics 
down at our puny energy scale E <C A. Let's further suppose that down at the energy 
scale E, the laws of physics are described by a real scalar field. (They're not of course: 
they're described by non-Abelian gauge fields and fermions, but the same argument 
applies in that case so bear with me). This scalar field will have some complicated 
interaction terms (3.1), where the precise form is dictated by all the stuff that's going 
on in the high energy superduper theory. What are these interactions? Well, we could 
write our dimensionful coupling constants \ n in terms of dimensionless couplings g n , 
multiplied by a suitable power of the relevant scale A, 

= ^ti (3-4) 

The exact values of dimensionless couplings g n depend on the details of the high-energy 
superduper theory, but typically one expects them to be of order 1: g n ~ 0(1). This 
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means that for experiments at small energies £ C A, the interaction terms of the 
form <p n with n > 4 will be suppressed by powers of (E/A) n ~ 4 . This is usually a 
suppression by many orders of magnitude, (e.g for the energies E explored at the 
LHC, E/Mpi ~ 1CT 16 ). It is this simple argument, based on dimensional analysis, that 
ensures that we need only focus on the first few terms in the interaction: those which 
are relevant and marginal. It also means that if we only have access to low-energy 
experiments (which we do!), it's going to be very difficult to figure out the high energy 
theory (which it is!), because its effects are highly diluted except for the relevant and 
marginal interactions. The discussion given above is a poor man's version of the ideas 
of effective field theory and Wilson's renormalization group, about which you can learn 
more in the "Statistical Field Theory" course. 

Examples of Weakly Coupled Theories 

In this course we'll study only weakly coupled field theories i.e. ones that can truly be 
considered as small perturbations of the free field theory at all energies. In this section, 
we'll look at two types of interactions 

1) 4 theory: 

C = \d^d^ - l -m 2 <p 2 - ^ (3.5) 

with Acl. We can get a hint for what the effects of this extra term will be. Expanding 
out 4 in terms of a- and at, we see a sum of interactions that look like 

a f a p a p a p ancl a p a p a p a p etc - ( 3 - 6 ) 

These will create and destroy particles. This suggests that the 4 Lagrangian describes 
a theory in which particle number is not conserved. Indeed, we could check that the 
number operator iV now satisfies [H, N] ^ 0. 

2) Scalar Yukawa Theory 

C = d^d^ip + -d^cj) - M 2 if>*if> - ^m 2 (f) 2 - gip*ip(f> (3.7) 

with g <C M, m. This theory couples a complex scalar %p to a real scalar cf). While 
the individual particle numbers of ip and <fi are no longer conserved, we do still have 
a symmetry rotating the phase of ip, ensuring the existence of the charge Q defined 
in (2.75) such that [Q, H] = 0. This means that the number of ip particles minus the 
number of ip anti-particles is conserved. It is common practice to denote the anti- 
particle as ty. 
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The scalar Yukawa theory has a slightly worrying aspect: the potential has a stable 
local minimum at = ip — 0, but is unbounded below for large enough —g<j). This 
means we shouldn't try to push this theory too far. 

A Comment on Strongly Coupled Field Theories 

In this course we restrict attention to weakly coupled field theories where we can use 
perturbative techniques. The study of strongly coupled field theories is much more 
difficult, and one of the major research areas in theoretical physics. For example, some 
of the amazing things that can happen include 

• Charge Fractionalization: Although electrons have electric charge 1, under 
the right conditions the elementary excitations in a solid have fractional charge 
1/N (where N e 2Z + 1). For example, this occurs in the fractional quantum 
Hall effect. 

• Confinement: The elementary excitations of quantum chromodynamics (QCD) 
are quarks. But they never appear on their own, only in groups of three (in a 
baryon) or with an anti-quark (in a meson). They are confined. 

• Emergent Space: There are field theories in four dimensions which at strong 
coupling become quantum gravity theories in ten dimensions! The strong cou- 
pling effects cause the excitations to act as if they're gravitons moving in higher 
dimensions. This is quite extraordinary and still poorly understood. It's called 
the AdS/CFT correspondence. 

3.1 The Interaction Picture 

There's a useful viewpoint in quantum mechanics to describe situations where we have 
small perturbations to a well- understood Hamiltonian. Let's return to the familiar 
ground of quantum mechanics with a finite number of degrees of freedom for a moment. 
In the Schrodinger picture, the states evolve as 

i^=ff|*>. (3-8) 

while the operators Os are independent of time. 

In contrast, in the Heisenberg picture the states are fixed and the operators change 
in time 

H (t) = e iHt O s e- iHt 
\1>)h = e %Ht m s (3-9) 
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The interaction picture is a hybrid of the two. We split the Hamiltonian up as 



H = H + H int (3.10) 

The time dependence of operators is governed by H , while the time dependence of 
states is governed by H mt . Although the split into H and ifi nt is arbitrary, it's useful 
when H is soluble (for example, when H is the Hamiltonian for a free field theory). 
The states and operators in the interaction picture will be denoted by a subscript / 
and are given by, 

= e iHot O s e- iHot (3.11) 

This last equation also applies to H int , which is time dependent. The interaction 
Hamiltonian in the interaction picture is, 

Hi = {H- mt )i = e lH °\H- mt ) s e""™ (3.12) 

The Schrodinger equation for states in the interaction picture can be derived starting 
from the Schrodinger picture 



'i 



dt dt 

-e^H-^se^^), (3.13) 



So we learn that 



dt 



i^=W)W)i (3-14) 



3.1.1 Dyson's Formula 



"Well, Birmingham has much the best theoretical physicist to work with, 
Peierls; Bristol has much the best experimental physicist, Powell; Cam- 
bridge has some excellent architecture. You can make your choice." 

Oppenheimer's advice to Dyson on which university position to accept. 
We want to solve (3.14). Let's write the solution as 

= U(t,t )\i>(t )) I (3.15) 
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where U (t, t ) is a unitary time evolution operator such that U(t 1 ,t 2 )U(t 2 , t 3 ) = U (ti,t 3 ) 
and U(t,t) = 1. Then the interaction picture Schrodinger equation (3.14) requires that 

dll , s „ 
i— = Hj(t)U (3.16) 

If Hi were a function, then we could simply solve this by 

U(t, t ) = exp (^-i jf" /fj(t') dt'^j (3.17) 

But there's a problem. Our Hamiltonian Hj is an operator, and we have ordering 
issues. Let's see why this causes trouble. The exponential of an operator is defined in 
terms of the expansion, 

exp(-i£ Htf) dt'^j = 1 - % jf H I (t')dt' + H^dt^j +...(3.18) 

But when we try to differentiate this with respect to t, we find that the quadratic term 
gives us 

~\ (jT - ^/(*) (jT Hj(t') dt'^j (3.19) 

Now the second term here looks good, since it will give part of the Hj(t)U that we 
need on the right-hand side of (3.16). But the first term is no good since the Hj{t) 
sits the wrong side of the integral term, and we can't commute it through because 
[H lit 1 ), H i{t)] 7^ when t! ^ t. So what's the way around this? 

Claim: The solution to (3.16) is given by Dyson's Formula. (Essentially first figured 
out by Dirac, although the compact notation is due to Dyson). 

U(t, t ) = T exp (-i J Hi{t') dt'^j (3.20) 

where T stands for time ordering where operators evaluated at later times are placed 
to the left 

f 1 (t 1 )0 2 {to) t 1 >t 2 
T(0 1 (t 1 )0 2 (t 2 )) = \ Ui; 2K2> (3.21) 
(0 2 (t 2 )0 1 (t 1 ) t 2 >h 

Expanding out the expression (3.20), we now have 

-t i -\2 r /•* f* 



U(t,t ) = 1-i I dt'Hi{t') + ^- I dt' I dt" Hi{t")Hi{t') 

Jt ^ Ut Jt' 

+ f dt' f dt" Hi(t')Hj(t") 

Jtn Jto 
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Actually these last two terms double up since 

f dt' f dt" Htf^Htf) = f dt" f dt' H^H^t') 

Jt Jt' Jto Jt 

= f dt' t dt" H^H^t") (3.22) 

Jto Jto 

where the range of integration in the first expression is over t" > t', while in the second 
expression it is t' < t" which is, of course, the same thing. The final expression is the 
same as the second expression by a simple relabelling. This means that we can write 

rt rt ft' 

U(t,t ) = l-i dt'H^t') + {-if / dt' / dt" H^H^t") + . . . (3.23) 

Jto J to J to 



Proof: The proof of Dyson's formula is simpler than explaining what all the nota- 
tion means! Firstly observe that under the T sign, all operators commute (since their 
order is already fixed by the T sign). Thus 



d 
l dt 



Texp^-i^ dt' =T H^t) exp (^-i dt' 

= Hi(t) T exp ^ -i dt' H^t')^ (3.24) 

since t, being the upper limit of the integral, is the latest time so Hjit) can be pulled 
out to the left. □ 

Before moving on, I should confess that Dyson's formula is rather formal. It is 
typically very hard to compute time ordered exponentials in practice. The power of 
the formula comes from the expansion which is valid when Hj is small and is very easily 
computed. 

3.2 A First Look at Scattering 

Let us now apply the interaction picture to field theory, starting with the interaction 
Hamiltonian for our scalar Yukawa theory, 

#int = gjd 3 x (3-25) 

Unlike the free theories discussed in Section 2, this interaction doesn't conserve particle 
number, allowing particles of one type to morph into others. To see why this is, we use 
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the interaction picture and follow the evolution of the state: \if)(t)) = U(t,t ) \ip(t )), 
where £/(t,to) is given by Dyson's formula (3.20) which is an expansion in powers of 
H mt . But H mt contains creation and annihilation operators for each type of particle. 
In particular, 

• ~ a + This operator can create or destroy particles. Let's call them 
mesons. 

• ip ~ b + c>: This operator can destroy ip particles through b, and create anti- 
particles through cL Let's call these particles nucleons. Of course, in reality 
nucleons are spin 1/2 particles, and don't arise from the quantization of a scalar 
field. But we'll treat our scalar Yukawa theory as a toy model for nucleons 
interacting with mesons. 

• ^' ~ b' + c: This operator can create nucleons through tf, and destroy anti- 
nucleons through c. 

Importantly, Q = N c — Nb remains conserved in the presence of H mt . At first order in 
perturbation theory, we find terms in H int like c^b^a. This kills a meson, producing a 
nucleon-anti-nucleon pair. It will contribute to meson decay <ft — y ipip. 

At second order in perturbation theory, we'll have more complicated terms in (H int ) 2 , 
for example (^^^(cba^). This term will give contributions to scattering processes 
ipip y (j) y ipxf;. The rest of this section is devoted to computing the quantum ampli- 
tudes for these processes to occur. 

To calculate amplitudes we make an important, and slightly dodgy, assumption: 

Initial and final states are eigenstates of the free theory 

This means that we take the initial state \i) at t — > — oo, and the final state |/) at 
t — > +oo, to be eigenstates of the free Hamiltonian H . At some level, this sounds 
plausible: at t — > — oo, the particles in a scattering process are far separated and don't 
feel the effects of each other. Furthermore, we intuitively expect these states to be 
eigenstates of the individual number operators N, which commute with Hq, but not 
if int . As the particles approach each other, they interact briefly, before departing again, 
each going on its own merry way. The amplitude to go from \i) to |/) is 

lim (f\U(t + ,t-)\i) = {f\S\i) (3.26) 

t±— >±oo 

where the unitary operator S is known as the S-matrix. (S is for scattering). There 
are a number of reasons why the assumption of non-interacting initial and final states 
is shaky: 
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• Obviously we can't cope with bound states. For example, this formalism can't 
describe the scattering of an electron and proton which collide, bind, and leave 
as a Hydrogen atom. It's possible to circumvent this objection since it turns out 
that bound states show up as poles in the S-matrix. 

• More importantly, a single particle, a long way from its neighbors, is never alone 
in field theory. This is true even in classical electrodynamics, where the electron 
sources the electromagnetic field from which it can never escape. In quantum 
electrodynamics (QED), a related fact is that there is a cloud of virtual photons 
surrounding the electron. This line of thought gets us into the issues of renormal- 
ization — more on this next term in the "AQFT" course. Nevertheless, motivated 
by this problem, after developing scattering theory using the assumption of non- 
interacting asymptotic states, we'll mention a better way. 

3.2.1 An Example: Meson Decay 

Consider the relativistically normalized initial and final states, 

|*> = 1°) 

\f) = v^Ep^6t, c t, )0) (3 . 27) 

The initial state contains a single meson of momentum p; the final state contains a 
nucleon-anti-nucleon pair of momentum qi and q<i- We may compute the amplitude for 
the decay of a meson to a nucleon-anti-nucleon pair. To leading order in g, it is 

(/| S \i) = -ig (f\ J d A x tf{x)il>(x)<f>(x) \i) (3.28) 

Let's go slowly. We first expand out ~ a + a) using (2.84). (Remember that the <fi in 
this formula is in the interaction picture, which is the same as the Heisenberg picture 
of the free theory). The a piece will turn \i) into something proportional to |0), while 
the a) piece will turn \i) into a two meson state. But the two meson state will have 
zero overlap with (/|, and there's nothing in the ip and ip^ operators that lie between 
them to change this fact. So we have 

/f H 3 k \ /2E- 

= -ig(f\ J d 4 x^{x)ip{x)e- ip - x \0) (3.29) 

where, in the second line, we've commuted ag past at, picking up a 5( 3 \p — k) delta- 
function which kills the d 3 k integral. We now similarly expand out ip ~ b + and 
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~ + c. To get non-zero overlap with (/|, only the and contribute, for they 
create the nucleon and anti-nucleon from |0). We then have 



and so we get our first quantum field theory amplitude. 

Notice that the 5-function puts constraints on the possible decays. In particular, the 
decay only happens at all if to > 2M. To see this, we may always boost ourselves 
to a reference frame where the meson is stationary, so p = (to, 0,0,0). Then the 
delta function imposes momentum conservation, telling us that <fi = — g*2 and to = 



Later you will learn how to turn this quantum amplitude into something more phys- 
ical, namely the lifetime of the meson. The reason this is a little tricky is that we must 
square the amplitude to get the probability for decay, which means we get the square 
of a 5-function. We'll explain how to deal with this in Section 3.6 below, and again in 
next term's "Standard Model" course. 

3.3 Wick's Theorem 

From Dyson's formula, we want to compute quantities like (/| T {H I (x 1 ) . . . Hj(x n )} 
where \i) and |/) are eigenstates of the free theory. The ordering of the operators is fixed 
by T, time ordering. However, since the H/s contain certain creation and annihilation 
operators, our life will be much simpler if we can start to move all annihilation operators 
to the right where they can start killing things in Recall that this is the definition 
of normal ordering. Wick's theorem tells us how to go from time ordered products to 
normal ordered products. 

3.3.1 An Example: Recovering the Propagator 

Let's start simple. Consider a real scalar field which we decompose in the Heisenberg 
picture as 




X 



ig {2nf5 {4 \ qi + q 2 -p) 



(3.30) 



2v/M 2 + |gl 2 . 



<f>(x) = (j) + (x) + (j)-(x) 



(3.31) 



where 




(3.32) 
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where the ± signs on (f) make little sense, but apparently you have Pauli and Heisenberg 
to blame. (They come about because + ~ e~ tEt , which is sometimes called the positive 
frequency piece, while 0~ ~ e +iEt - ls ^ e negative frequency piece). Then choosing 
x° > y°, we have 

T 4>{x)4>{y) = <j)(x)(f)(y) 

= (0 + (x) + r{xm + {y) + r(v)) (3.33) 

= + (x)(f) + (y) + (P~(x)(f) + (y) + (p-(y)(f) + (x) + [(f> + (x), <fr(y)} + <F {*)<!>' (v) 

where the last line is normal ordered, and for our troubles we have picked up a factor 
of D(x — y) = [(f> + (x), <j)~{y)\ which is the propagator we met in (2.90). So for x° > y° 
we have 

T <j>{x)<f>{y) =: <t>{x)<t>{y) : + D(x - y) (3.34) 
Meanwhile, for y° > x°, we may repeat the calculation to find 

T <t>{x)<t>{y) =: <f>{x)<j>{y) : + D(y - x) (3.35) 

So putting this together, we have the final expression 

T </>(x)(f>(y) =: (j){x)(t>{y) : + A F (x - y) (3.36) 

where Ap(x — y) is the Feynman propagator defined in (2.93), for which we have the 
integral representation 

/d A k i e ik-(x-y) 
(27t) 4 k z — in 2 + xe 

Let me reiterate a comment from Section 2: although T (p(x)<p(y) and : <fi(x)(j)(y) : are 
both operators, the difference between them is a c- number function, Ap{x — y). 

Definition: We define the contraction of a pair of fields in a string of operators 
. . . <j){xi) . . . <j){x-2) ... to mean replacing those operators with the Feynman propaga- 
tor, leaving all other operators untouched. We use the notation, 



...0(x!)...0(x 2 )... (3.38) 
to denote contraction. So, for example, 

%{x)^y) = A F {x - y) (3.39) 
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A similar discussion holds for complex scalar fields. We have 

Tip(x)^(y) =: *fi(x)^(y) = +&f(x - y) (3.40) 
prompting us to define the contraction 



ip(x)ip'(y) = A F (x — y) and tfi(x)ip(y) = ifi^ (x)ifi\y) = (3-41) 

3.3.2 Wick's Theorem 

For any collection of fields 0i = 0(xi), 2 = 0(x 2 ), etc, we have 

T(0! . . . (p n ) =: (pi . . . (fi n : + : all possible contractions : (3.42) 

To see what the last part of this equation means, let's look at an example. For n — 4, 
the equation reads 

T(0i0 2 030 4 ) = : 01020304 : +0102 : 0304 : +0103 : 0204 : + four similar terms 

+ 0102 0304 + 0103 0204 + 0104 0203 (3-43) 

Proof: The proof of Wick's theorem proceeds by induction and a little thought. It's 
true for n — 2. Suppose it's true for 02 . . . 0„ and now add (fi\. We'll take x® > x° k 
for all k — 2, . . . , n. Then we can pull 0i out to the left of the time ordered product, 
writing 

T (0i02 .. .(fi n ) = ((fii + (fii) (: 02 . • • 0„ : + : contractions :) (3.44) 

The (fii term stays where it is since it is already normal ordered. But in order to write 
the right-hand side as a normal ordered product, the 0^ term has to make its way 
past the crowd of 0^ operators. Each time it moves past 0^, we pick up a factor of 

010/c = Ap(xi — Xk) from the commutator. (Try it!) □ 

3.3.3 An Example: Nucleon Scattering 

Let's look at ipip — > ifiifi scattering. We have the initial and final states 

\i) = y/2^y/2E^b^ 6t |0> = \ Pl ,p 2 ) 

We can then look at the expansion of (/| S\i). In fact, we really want to calculate 
(/| S — 1 \i) since we're not interested in situations where no scattering occurs. At 
order g 2 we have the term 

J d 4 Xl d A x 2 T (^(x 1 )^(x 1 )(fi(x 1 )^(x 2 )^(x 2 )(fi(x 2 )) (3.46) 
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Now, using Wick's theorem we see there is a piece in the string of operators which looks 
like 

: ip 1 {x 1 )i!{x 1 )i! 1 {x2)ip{x 2 ) : 0(xi)0(a; 2 ) (3.47) 

which will contribute to the scattering because the two ip fields annihilate the ip parti- 
cles, while the two i\y fields create ip particles. Any other way of ordering the ip and 
ifi fields will give zero contribution. This means that we have 

= (p[,P 2 \ ^(xi)^ (x 2 ) |0) (0\ip(x 1 )ip(x 2 ) \pi,Ps) 

— ^giPi'^l+iP^'^ _|_ e ip' 1 -X2+ip' 2 -xi^j ^ e -ipi-Xi—ip2-X2 _|_ e -ipi-X2-ip2-xi*j 

— e ixi-(p'i-Pl)+ix2-(p' 2 -p2) _|_ e ixi-(p' 2 -pi)+ix 2 -(p' 1 -p2) _|_ ^ x \ (3.48) 

where, in going to the third line, we've used the fact that for relativistically normalized 

states, 

(0\i;(x)\ P ) = e - tp - x (3.49) 
Now let's insert this into (3.46), to get the expression for (/| 5* \i) at order g 2 , 

J d ^ X2 [e-- + e-- + („ „ *)] / ^ (3.50) 

where the expression in square brackets is (3.48), while the final integral is the 
propagator which comes from the contraction in (3.47). Now the (x\ -H- x 2 ) terms 
double up with the others to cancel the factor of 1/2 out front. Meanwhile, the x\ and 
x 2 integrals give delta-functions. We're left with the expression 

+ ^ (4 V 2 -Pi + k) 5 {4) (K -Pi- k)] (3.51) 
Finally, we can trivially do the d A k integral using the delta-functions to get 

(2tt) 4 5( 4 ) ( Pl+P2 -p' 1 -p' 2 ) 



-igf 



(pi — Pi) 2 — rn 2 + 



16 



(Pi 



Ptf 



77r 



ie 



In fact, for this process we may drop the +ie terms since the denominator is never 
zero. To see this, we can go to the center of mass frame, where p\ = —p 2 and, by 



momentum conservation, \pi\ = \pi\. This ensures that the 4- momentum of the meson 
is k = (0,p — p'), so k 2 < 0. We therefore have the end result, 



i 

+ 



(pi — p{) 2 — m 2 (pi — P2) 2 — m 2 



(2n)U^{p 1 +p 2 -p' 1 -p' 2 ) (3.52) 



We will see another, much simpler way to reproduce this result shortly using Feynman 
diagrams. This will also shed light on the physical interpretation. 

This calculation is also relevant for other scattering processes, such as ipip — > ijrij), 
ipip — > Each of these comes from the term (3.48) in Wick's theorem. However, we 
will never find a term that contributes to scattering ipip — > ijj'ijj, for this would violate 
the conservation of Q charge. 

Another Example: Meson-Nucleon Scattering 

If we want to compute ipcf) — > x/xf) scattering at order g 2 , we would need to pick out the 
term 



: i) ] (xi)(j)(xi)i)(x2)(l){x2) ■ ip(x 1 )ip^(x 2 ) (3.53) 

and a similar term with ip and ip^ exchanged. Once more, this term also contributes to 
similar scattering processes, including ipip — > tjxft and (jxp — > ipip. 

3.4 Feynman Diagrams 

"Like the silicon chips of more recent years, the Feynman diagram was 

bringing computation to the masses." T ,. n , . 

Julian bchwinger 

As the above example demonstrates, to actually compute scattering amplitudes using 
Wick's theorem is rather tedious. There's a much better way. It requires drawing pretty 
pictures. These pictures represent the expansion of (f\S\i) and we will learn how to 
associate numbers (or at least integrals) to them. These pictures are called Feynman 
diagrams. 

The object that we really want to compute is (/| S — 1 since we're not interested in 
processes where no scattering occurs. The various terms in the perturbative expansion 
can be represented pictorially as follows 

• Draw an external line for each particle in the initial state \i) and each particle 
in the final state |/). We'll choose dotted lines for mesons, and solid lines for 
nucleons. Assign a directed momentum p to each line. Further, add an arrow to 
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solid lines to denote its charge; we'll choose an incoming (outgoing) arrow in the 
initial state for ip We choose the reverse convention for the final state, where 
an outgoing arrow denotes ip. 



• Join the external lines together with trivalent vertices , 




Each such diagram you can draw is in 1-1 correspondence with the terms in the 
expansion of (f\S — l\i). 

3.4.1 Feynman Rules 

To each diagram we associate a number, using the Feynman rules 

• Add a momentum k to each internal line 

• To each vertex, write down a factor of 



where ki is the sum of all momenta flowing into the vertex. 

• For each internal dotted line, corresponding to a particle with momentum k, 
we write down a factor of 



We include the same factor for solid internal ip lines, with m replaced by the 
nucleon mass M. 




(3.54) 




(3.55) 
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3.5 Examples of Scattering Amplitudes 

Let's apply the Feynman rules to compute the amplitudes for various processes. We 
start with something familiar: 

Nucleon Scattering Revisited 

Let's look at how this works for the %pip — > ipip scattering at order g 2 . We can write 
down the two simplest diagrams contributing to this process. They are shown in Figure 
9. 




Figure 9: The two lowest order Feynman diagrams for nucleon scattering. 

Applying the Feynman rules to these diagrams, we get 
1 1 



K-igY 



(pi ~ p[) 2 ~~ m2 (pi ~ p'2) 2 ~~ m2 



(2 7 :Y6^\p 1 +p 2 -p' 1 ~p' 2 ) (3.56) 



which agrees with the calculation (3.51) that we performed earlier. There is a nice 
physical interpretation of these diagrams. We talk, rather loosely, of the nucleons 
exchanging a meson which, in the first diagram, has momentum k = (pi—p'i) = {jp2—p l 2)- 
This meson doesn't satisfy the usual energy dispersion relation, because k 2 7^ m 2 : the 
meson is called a virtual particle and is said to be off-shell (or, sometimes, off mass- 
shell). Heuristically, it can't live long enough for its energy to be measured to great 
accuracy. In contrast, the momentum on the external, nucleon legs all satisfy p 2 = M 2 , 
the mass of the nucleon. They are on-shell. One final note: the addition of the two 
diagrams above ensures that the particles satisfy Bose statistics. 

There are also more complicated diagrams which will contribute to the scattering 
process at higher orders. For example, we have the two diagrams shown in Figures 
10 and 11, and similar diagrams with p[ and p' 2 exchanged. Using the Feynman rules, 
each of these diagrams translates into an integral that we will not attempt to calculate 
here. And so we go on, with increasingly complicated diagrams, all appearing at higher 
order in the coupling constant g. 
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Figure 10: A contribution at 0(g 4 ). 



i 

o 




Figure 11: A contribution at 0(g 6 ) 



Amplitudes 

Our final result for the nucleon scattering amplitude {f\S — 1 \i) at order g 2 was 



-igf 



+ 



(pi — p'i) 2 — m2 (pi — p'2) 2 ~ m 



The 5-function follows from the conservation of 4-momentum which, in turn, follows 
from spacetime translational invariance. It is common to all S-matrix elements. We will 
define the amplitude Afi by stripping off this momentum-conserving delta-function, 



(f\S-l\i)=iA fi (2n) 4 S^( PF - Pl ) 



(3.57) 



where P i (pp) is the sum of the initial (final) 4-momenta, and the factor of % out front 
is a convention which is there to match non-relativistic quantum mechanics. We can 
now refine our Feynman rules to compute the amplitude iAf% itself: 

• Draw all possible diagrams with appropriate external legs and impose 4-momentum 
conservation at each vertex. 



• Write down a factor of (—ig) at each vertex. 

• For each internal line, write down the propagator 

• Integrate over momentum k flowing through each loop f <i 4 £;/(27r) 4 . 

This last step deserves a short explanation. The diagrams we've computed so far have 
no loops. They are tree level diagrams. It's not hard to convince yourself that in 
tree diagrams, momentum conservation at each vertex is sufficient to determine the 
momentum flowing through each internal line. For diagrams with loops, such as those 
shown in Figures 10 and 11, this is no longer the case. 
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Figure 12: The two lowest order Feynman diagrams for nucleoli to meson scattering. 



Nucleon to Meson Scattering 

Let's now look at the amplitude for a nucleon-anti- nucleon pair to annihilate into a 
pair of mesons: ip'tp — > cfxf). The simplest Feynman diagrams for this process are shown 
in Figure 12 where the virtual particle in these diagrams is now the nucleon ip rather 
than the meson <fi. This fact is reflected in the denominator of the amplitudes which 
are given by 



iA = (-igf 



M 2 



(3.58) 



_(pi-p[) 2 -M 2 (pi-p^) 2 

As in (3.52), we've dropped the ie from the propagators as the denominator never 
vanishes. 



Nucleon-Anti-Nucleon Scattering 







Figure 13: The two lowest order Feynman diagrams for nucleon-anti-nucleon scattering. 

For the scattering of a nucleon and an anti-nucleon, ipxf; — > tp'ip, the Feynman 
diagrams are a little different. At lowest order, they are given by the diagrams of 
Figure 13. It is a simple matter to write down the amplitude using the Feynman rules, 



(Pi — Pi) 2 — m2 (Pi + P2) 2 — m 2 + ie 



(3.59) 
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Notice that the momentum dependence in the sec- 
ond term is different from that of nucleon-nucleon 
scattering (3.56), reflecting the different Feynman di- 
agram that contributes to the process. In the center 
of mass frame, p\ = —P2, the denominator of the sec- 
ond term is 4(M 2 +pf) — m 2 . If m < 2M, then this 
term never vanishes and we may drop the ie. In con- 
trast, if m > 2M, then the amplitude corresponding 
to the second diagram diverges at some value of p. 
In this case it turns out that we may also neglect the 
ie term, although for a different reason: the meson 
is unstable when m > 2M, a result we derived in 
(3.30). When correctly treated, this instability adds 
a finite imaginary piece to the denominator which 

overwhelms the ie. Nonetheless, the increase in the scattering amplitude which we see 




Figure 14: 



in the second diagram when 4(M 2 + p' 



m 



2 : 



is what allows us to discover new parti- 



cles: they appear as a resonance in the cross section. For example, the Figure 14 shows 
the cross-section (roughly the amplitude squared) plotted vertically for e + e~ — > fi + fi~ 
scattering from the ALEPH experiment in CERN. The horizontal axis shows the center 
of mass energy. The curve rises sharply around 91 GeV, the mass of the Z-boson. 



Meson Scattering 

For 00 — > 00, the simplest diagram we can write 
down has a single loop, and momentum conservation at 
each vertex is no longer sufficient to determine every 
momentum passing through the diagram. We choose 
to assign the single undetermined momentum k to the 
right-hand propagator. All other momenta are then de- 
termined. The amplitude corresponding to the diagram 
shown in the figure is 



-wY 



d A k 



1 



(2?r) 4 (k 2 - M 2 + ie)((k + p[) 2 

x 



M 2 + ie) 
1 




Figure 15: 



((fc +p{- pi) 2 — M 2 + ie)((k - p' 2 ) 2 - M 2 



ie 



These integrals can be tricky. For large k, this integral goes as J d A k/k 8 , which is at 
least convergent as k — > oo. But this won't always be the case! 
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3.5.1 Mandelstam Variables 

We see that in many of the amplitudes above — in particular those that include the 
exchange of just a single particle — the same combinations of momenta are appear- 
ing frequently in the denominators. There are standard names for various sums and 
differences of momenta: they are known as Mandelstam variables. They are 



s = ( Pl+P2 ) 2 = {p> x 


+ P 2 ) 2 




t = ( Pl -p[) 2 = (pjj 


-P2) 2 


(3.60) 


u = (pi-p' 2 ) 2 = (pa 







where, as in the examples above, p\ and P2 are the momenta of the two initial particles, 
and p[ and p' 2 are the momenta of the final two particles. We can define these variables 
whether the particles involved in the scattering are the same or different. To get a feel 
for what these variables mean, let's assume all four particles are the same. We sit in 
the center of mass frame, so that the initial two particles have four-momenta 

Pl = (E, 0, 0, p) and p 2 = (E, 0, 0, -p) (3.61) 

The particles then scatter at some angle 9 and leave with momenta 

p[ = (E, 0,psm9,pcos9) and p' 2 = (E, 0, — psin9, — pcos9) (3.62) 

Then from the above definitions, we have that 

s = AE 2 and t = -2p 2 (l - cos9) and u = -2p 2 (l + cos 9) (3.63) 

The variable s measures the total center of mass energy of the collision, while the 
variables t and u are measures of the momentum exchanged between particles. (They 
are basically equivalent, just with the outgoing particles swapped around). Now the 
amplitudes that involve exchange of a single particle can be written simply in terms of 
the Mandelstam variables. For example, for nucleon-nucleon scattering, the amplitude 
(3.56) is schematically A ~ (t — m 2 )' 1 + (u — m 2 )~ l . For the nucleon-anti-nucleon 
scattering, the amplitude (3.59) is A ~ (t — m 2 )^ 1 + (s — m 2 )" 1 . We say that the 
first case involves "t-channel" and "u-channel" diagrams. Meanwhile the nucleon-anti- 
nucleon scattering is said to involve "t-channel" and "s-channel" diagrams. (The first 
diagram indeed includes a vertex that looks like the letter "T"). 

Note that there is a relationship between the Mandelstam variables. When all the 
masses are the same we have s + t + u = AM 2 . When the masses of all 4 particles differ, 
this becomes s + t + u = M 2 . 



3.5.2 The Yukawa Potential 

So far we've computed the quantum amplitudes for various scattering processes. But 
these quantities are a little abstract. In Section 3.6 below (and again in next term's 
"Standard Model" course) we'll see how to turn amplitudes into measurable quantities 
such as cross-sections, or the lifetimes of unstable particles. Here we'll instead show 
how to translate the amplitude (3.52) for nucleon scattering into something familiar 
from Newtonian mechanics: a potential, or force, between the particles. 

Let's start by asking a simple question in classical field theory that will turn out to 
be relevant. Suppose that we have a fixed 5-function source for a real scalar field (f>, 
that persists for all time. What is the profile of 4>(x)? To answer this, we must solve 
the static Klein-Gordon equation, 

-V 2 (j) + m 2 (f) = 6 {3) (x) (3.64) 

We can solve this using the Fourier transform, 

d 3 k - . . 

Plugging this into (3.64) tells us that {k 2 + m 2 )0(/c) = 1, giving us the solution 



7^-4 ^ (3-66) 

(27r)3 fc2 + m 2 1 > 

Let's now do this integral. Changing to polar coordinates, and writing k • x — kr cos 9, 
we have 

1 f°° „ k 2 2 sin kr 

4>\x) = -, — T77 / dk — — 

YK ' (2vr) 2 J k 2 + m 2 kr 

1 f +OQ A; sin kr 
dk 



(2ir) 2 r J„ 00 k 2 + m 2 



1 -Re 



2nr 



+°° dk ke ikr 

.„ 2~7ri k 2 + m 2 ^ ' 



We compute this last integral by closing the contour in the upper half plane k — > +ioo, 
picking up the pole at k = +im. This gives 

Mx) = -L e~ mr (3.68) 

The field dies off exponentially quickly at distances 1/m, the Compton wavelength of 
the meson. 
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Now we understand the profile of the <p field, what does this have to do with the force 
between ip particles? We do very similar calculations to that above in electrostatics 
where a charged particle acts as a 5-function source for the gauge potential: — V 2 A = 
S^(x), which is solved by Aq = 1/Aixr. The profile for Aq then acts as the potential 
energy for another charged (test) particle moving in this background. Can we give the 
same interpretation to our scalar field? In other words, is there a classical limit of the 
scalar Yukawa theory where the ip particles act as 5-function sources for 0, creating 
the profile (3.68)? And, if so, is this profile then felt as a static potential? The answer 
is essentially yes, at least in the limit M 3> m. But the correct way to describe the 
potential felt by the ip particles is not to talk about classical fields at all, but instead 
work directly with the quantum amplitudes. 

Our strategy is to compare the nucleon scattering amplitude (3.52) to the corre- 
sponding amplitude in non-relativistic quantum mechanics for two particles interacting 
through a potential. To make this comparison, we should first take the non-relativistic 
limit of (3.52). Let's work in the center of mass frame, with p = p\ = —p 2 and 
p' = Pi = —P2- The non-relativistic limit means \p\ <C M which, by momentum 
conservation, ensures that \p'\ <C M. In fact one can check that, for this particular 
example, this limit doesn't change the scattering amplitude (3.52): it's given by 



iA = +ig 



1 

+ 



(p — p') 2 + m 2 (p + p 



1)2 _ |_ m 2 



(3.69) 



How do we compare this to scattering in quantum mechanics? Consider two particles, 
separated by a distance r, interacting through a potential U(r). In non-relativistic 
quantum mechanics, the amplitude for the particles to scatter from momentum states 
±p into momentum states ±p' can be computed in perturbation theory, using the 
techniques described in Section 3.1. To leading order, known in this context as the 
Born approximation, the amplitude is given by 

(p'\ U{f) \p) = -i f dhU^e^-^ (3.70) 



There's a relative factor of (2M) 2 that arises in comparing the quantum field theory 
amplitude A to (p'\ U (r) \p), that can be traced to the relativistic normalization of the 
states \pi,P2)- (It is also necessary to get the dimensions of the potential to work out 
correctly). Including this factor, and equating the expressions for the two amplitudes, 
we get 

d s r U{f) e -iV-f)* = (3 . 71) 

(p — p'Y + m 1 
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where we've introduced the dimensionless parameter A = g/2M. We can trivially invert 
this to find, 



But this is exactly the integral (3.66) we just did in the classical theory. We have 



This is the Yukawa potential. The force has a range 1/m, the Compton wavelength of 
the exchanged particle. The minus sign tells us that the potential is attractive. 

Notice that quantum field theory has given us an entirely new perspective on the 
nature of forces between particles. Rather than being a fundamental concept, the force 
arises from the virtual exchange of other particles, in this case the meson. In Section 6 
of these lectures, we will see how the Coulomb force arises from quantum field theory 
due to the exchange of virtual photons. 

We could repeat the calculation for nucleon-anti-nucleon scattering. The amplitude 
from field theory is given in (3.59). The first term in this expression gives the same 
result as for nucleon-nucleon scattering with the same sign. The second term vanishes in 
the non-relativisitic limit (it is an example of an interaction that doesn't have a simple 
Newtonian interpretation). There is no longer a factor of 1/2 in (3.70), because the 
incoming/outgoing particles are not identical, so we learn that the potential between 
a nucleon and anti-nucleon is again given by (3.73). This reveals a key feature of 
forces arising due to the exchange of scalars: they are universally attractive. Notice 
that this is different from forces due to the exchange of a spin 1 particle — such as 
electromagnetism — where the sign flips when we change the charge. However, for 
forces due to the exchange of a spin 2 particle — i.e. gravity — the force is again 
universally attractive. 

3.5.3 ^ 4 Theory 

Let's briefly look at the Feynman rules and scattering amplitudes for the interaction 
Hamiltonian 




(3.72) 




(3.73) 



A 

~ 4! 



i 



(3.74) 



The theory now has a single interaction vertex, which comes with a factor of (— iX), 
while the other Feynman rules remain the same. Note that we assign (— z'A) to the 



vertex rather than (— iA/4!). To see why this is, we can look at 00 — > 00 scattering, 
which has its lowest contribution at order A, with the term 

(p[,P2\ '■ (f)(x)(j)(x)(f)(x)(f)(x) : |pi,p 2 ) (3.75) 

Any one of the fields can do the job of annihilation or creation. This gives 4! different 
contractions, which cancels the 1/4! sitting out front. 

Feynman diagrams in the 4 theory sometimes come with extra 
combinatoric factors (typically 2 or 4) which are known as symmetry 
factors that one must take into account. For more details, see the book 
by Peskin and Schroeder. 

Using the Feynman rules, the scattering amplitude for 00 — > <f)<f) is Figure 16: 
simply iA = —iX. Note that it doesn't depend on the angle at which 
the outgoing particles emerge: in 4 theory the leading order two-particle scattering 
occurs with equal probability in all directions. Translating this into a potential between 
two mesons, we have 

So scattering in 4 theory is due to a 5-function potential. The particles don't know 
what hit them until it's over. 

3.5.4 Connected Diagrams and Amputated Diagrams 

We've seen how one can compute scattering amplitudes by writing down all Feynman 
diagrams and assigning integrals to them using the Feynman rules. In fact, there are 
a couple of caveats about what Feynman diagrams you should write down. Both of 
these caveats are related to the assumption we made earlier that "initial and final states 
are eigenstates of the free theory" which, as we mentioned at the time, is not strictly 
accurate. The two caveats which go some way towards ameliorating the problem are 
the following 

• We consider only connected Feynman diagrams, where every part of the diagram 
is connected to at least one external line. As we shall see shortly, this will be 
related to the fact that the vacuum |0) of the free theory is not the true vacuum 
\fl) of the interacting theory. An example of a diagram that is not connected is 
shown in Figure 17. 



\ / 

s ' 

A -ik 
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• We do not consider diagrams with loops on external lines, for example the diagram 
shown in the Figure 18. We will not explain how to take these into account in this 
course, but you will discuss them next term. They are related to the fact that the 
one-particle states of the free theory are not the same as the one-particle states of 
the interacting theory. In particular, correctly dealing with these diagrams will 
account for the fact that particles in interacting quantum field theories are never 
alone, but surrounded by a cloud of virtual particles. We will refer to diagrams 
in which all loops on external legs have been cut-off as "amputated" . 




Figure 17: A disconnected diagram. Figure 18: An un-amputated diagram 



3.6 What We Measure: Cross Sections and Decay Rates 

So far we've learnt to compute the quantum amplitudes for particles decaying or scat- 
tering. As usual in quantum theory, the probabilities for things to happen are the 
(modulus) square of the quantum amplitudes. In this section we will compute these 
probabilities, known as decay rates and cross sections. One small subtlety here is that 
the S-matrix elements (/| S — 1 \i) all come with a factor of (27r) 4 5^(j)F — Pi), so we 
end up with the square of a delta-function. As we will now see, this comes from the 
fact that we're working in an infinite space. 



3.6.1 Fermi's Golden Rule 

Let's start with something familiar and recall how to derive Fermi's golden rule from 
Dyson's formula. For two energy eigenstates \m) and \n), with E m ^ E n , we have to 
leading order in the interaction, 

(m\ U(t) \n) = -i (m\ [ dtH^t) \n) 

Jo 

t 

I „iu>t' 



—i (m\ H int \n) \ dt' e 
Jo 



-(m\H int \n) (3.77) 
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where uo = E m — E n . This gives us the probability for the transition from \n) to \m) in 
time t, as 



P n ^ m {t) = \(m\U (t) \n)\ 2 = 2| (m| fl int \n) \ ' 

The function in brackets is plotted in Figure 19 for fixed t. 
We see that in time t, most transitions happen in a region 
between energy eigenstates separated by AE = 2ir/t. As 
t —7- oo, the function in the figure starts to approach a delta- 
function. To find the normalization, we can calculate 



1 — cos out 



(3.78) 



+ 0O 

duo 

oo 

1 — COS uot 

U 2 



1 — cos out 



10 - 



7rt 




— > irt5(cu) as t — > oo 



Figure 19: 



Consider now a transition to a cluster of states with density 
p(E). In the limit t — >■ oo, we get the transition probability 



dE m p(E m ) 2| (m| if int |n) 



2tt| (m| flint |n> | 2 p(^n)t 



1 — cos cut 



(3.79) 



which gives a constant probability for the transition per unit time for states around 
the same energy E n ~ E m = E. 



P„ 



2tt| (m|fl int |n)| 2 p(^) 



(3.80) 



This is Fermi's Golden Rule. 



In the above derivation, we were fairly careful with taking the limit as t — > oo. 
Suppose we were a little sloppier, and first chose to compute the amplitude for the 
state |n) at t — >■ — oo to transition to the state \m) at t — > +oo. Then we get 

/•i=+oo 

-i (m\ / Hj(t) \n) = -i (m\ H int \n) 2tt5(u) (3.81) 

J t=—oo 

Now when squaring the amplitude to get the probability, we run into the problem of 
the square of the delta-function: P n ^ m — | (m\ H mt \n) | 2 (27r) 2 <5(ci;) 2 . Tracking through 
the previous computations, we realize that the extra infinity is coming because P m ^ n 
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is the probability for the transition to happen in infinite time t — » oo. We can write 
the delta-functions as 

{2n) 2 5{u) 2 = (2ir)6(u) T (3.82) 

where T is shorthand for t — > oo (we used a very similar trick when looking at the 
vacuum energy in (2.25)). We now divide out by this power of T to get the transition 
probability per unit time, 

P n ^ m = 2tx\ (m\ H int \n) | 2 5(u) (3.83) 

which, after integrating over the density of final states, gives us back Fermi's Golden 
rule. The reason that we've stressed this point is because, in our field theory calcula- 
tions, we've computed the amplitudes in the same way as (3.81), and the square of the 
(^-functions will just be re- interpreted as spacetime volume factors. 

3.6.2 Decay Rates 

Let's now look at the probability for a single particle \i) of momentum pj (I=initial) 
to decay into some number of particles |/) with momentum Pi and total momentum 
Pf = ^2iPi- This is given by 

pJ mvm (3 - 84) 

Our states obey the relativistic normalization formula (2.65), 

(z| i) = (2vr) 3 2E fl 5 {3) (0) = 2E fl V (3.85) 
where we have replaced 5^(0) by the volume of 3-space. Similarly, 

(f\f)= J] 2E A V (3.86) 

final states 

If we place our initial particle at rest, so pi — and = m, we get the probability 
for decay 

P J 4£^ {P ,- PF )VT n (3.87) 

final states 

where, as in the second derivation of Fermi's Golden Rule, we've exchanged one of the 
delta-functions for the volume of spacetime: (27r) 4 5^ 4 )(0) = VT. The amplitudes Aft 
are, of course, exactly what we've been computing. (For example, in (3.30), we saw 
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that A = —g for a single meson decaying into two nucleons). We can now divide out 
by T to get the transition function per unit time. But we still have to worry about 
summing over all final states. There are two steps: the first is to integrate over all 
possible momenta of the final particles: V J d 3 pi/(2ir) 3 . The factors of spatial volume 
V in this measure cancel those in (3.87), while the factors of l/2Ep. in (3.87) conspire 
to produce the Lorentz invariant measure for 3-momentum integrals. The result is an 
expression for the density of final states given by the Lorentz invariant measure 

final states " 

The second step is to sum over all final states with different numbers (and possibly 
types) of particles. This gives us our final expression for the decay probability per unit 
time, r = P. 

r=^ £ / \A fl \ 2 dIi (3.89) 

final states 

T is called the width of the particle. It is equal to the reciprocal of the half- life r = 1/T. 
3.6.3 Cross Sections 

Collide two beams of particles. Sometimes the particles will hit and bounce off each 
other; sometimes they will pass right through. The fraction of the time that they collide 
is called the cross section and is denoted by a. If the incoming flux F is defined to 
be the number of incoming particles per area per unit time, then the total number of 
scattering events N per unit time is given by, 

N = Fa (3.90) 

We would like to calculate a from quantum field theory. In fact, we can calculate a more 
sensitive quantity da known as the differential cross section which is the probability 
for a given scattering process to occur in the solid angle {0,4>). More precisely 

Differential Probability 1 1 2 

= Unit Time x Unit Flux = 4EiE 2 VF ( } 

where we've used the expression for probability per unit time that we computed in the 
previous subsection. E\ and Ei are the energies of the incoming particles. We now 
need an expression for the unit flux. For simplicity, let's sit in the center of mass frame 
of the collision. We've been considering just a single particle per spatial volume V, 
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meaning that the flux is given in terms of the 3- velocities Vi as F = \v\ — v<i\jV . This 
then gives, 



da 



1 



1 



— \A fi \ 2 dU 



(3.92) 



AE 1 E 2 \vx - V2 



If you want to write this in terms of momentum, then recall from your course on special 
relativity that the 3- velocities Vi are related to the momenta by v = p/my/l — v 2 = 



Equation (3.92) is our final expression relating the S-matrix to the differential cross 
section. You may now take your favorite scattering amplitude, and compute the proba- 
bility for particles to fly out at your favorite angles. This will involve doing the integral 
over the phase space of final states, with measure dU. Notice that different scattering 
amplitudes have different momentum dependence and will result in different angular 
dependence in scattering amplitudes. For example, in 4 theory the amplitude for tree 
level scattering was simply A = —A. This results in isotropic scattering. In contrast, 
for nucleon-nucleon scattering we have schematically A ~ (t — m 2 ) -1 + (u — m 2 )~ l . 
This gives rise to angular dependence in the differential cross-section, which follows 
from the fact that, for example, t = — 2\p\ 2 (l — cos6), where 9 is the angle between the 
incoming and outgoing particles. 

3.7 Green's Functions 

So far we've learnt to compute scattering amplitudes. These are nice and physical (well 
- they're directly related to cross-sections and decay rates which are physical) but there 
are many questions we want to ask in quantum field theory that aren't directly related 
to scattering experiments. For example, we might want to compute the viscosity of 
the quark gluon plasma, or the optical conductivity in a tentative model of strange 
metals, or figure out the non-Gaussianity of density perturbations arising in the CMB 
from novel models of inflation. All of these questions are answered in the framework of 
quantum field theory by computing elementary objects known as correlation functions. 
In this section we will briefly define correlation functions, explain how to compute them 
using Feynman diagrams, and then relate them back to scattering amplitudes. We'll 
leave the relationship to other physical phenomena to other courses. 

We'll denote the true vacuum of the interacting theory as \Q). We'll normalize H 
such that 



p/p°. 



H\Q) = 



(3.93) 
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and fl) = 1. Note that this is different from the state we've called |0) which is the 
vacuum of the free theory and satisfies H \0) = 0. Define 

G (n) (ii, ...,x n ) = {n\T<j> H ( Xl ) . . . ct> H {x n ) \n) (3.94) 

where <pH is <fi in the Heisenberg picture of the full theory, rather than the interaction 
picture that we've been dealing with so far. The are called correlation functions, 
or Green's functions. There are a number of different ways of looking at these objects 
which tie together nicely. Let's start by asking how to compute G^ using Feynman 
diagrams. We prove the following result 

Claim: We use the notation fa = fax±), and write fan to denote the field in the 
Heisenberg picture, and fai to denote the field in the interaction picture. Then 

GM( Xl , ...,x n ) = (n\Tfa H ... fa H |n> = (O|T01 ( ; |g ^ g|O) (3.95) 

where the operators on the right-hand side are evaluated on |0), the vacuum of the free 
theory. 

Proof: Take t± > t 2 > ■ ■ ■ > t n . Then we can drop the T and write the numera- 
tor of the right-hand side as 

(0| C/ 7 (+oo, h)fa! U (t U t 2 ) 02/ • • • fai U r (t n , -oo) |0> 

We'll use the factors of Ui(tk,tk+i) = Texp(— i ff k+1 Hj) to convert each of the 0/ into 
<Ph and we choose operators in the two pictures to be equal at some arbitrary time to- 
Then we can write 

(0| C/ 7 (+oo, t^faj U (ti, t 2 ) (j) 2I ... fat Uj(t n , -oo) |0> 

= (0| Ui(+oo, t ) fan ...fan £/j(t , -oo) |0) 

Now let's deal with the two remaining U (to, ±oo) at either end of the string of operators. 
Consider an arbitrary state and look at 

(tf | Uiit, -oo) |0) = (tf | U(t, -oo) |0) (3.96) 

where U(t, — oo) is the Schrodinger evolution operator, and the equality above follows 
because H |0) = 0. Now insert a complete set of states, which we take to be energy 
eigenstates of H = H + if int , 



(tf | U(t, -oo) |0) = U(t, -oo) 



|n> (n| + J>> (n\ 



|0> 



(*|fi)(fi|0)+ lim y^ e ^(*'-*) /^fl n \ / n | o> (3.97) 



-OO ' 
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But the last term vanishes. This follows from the Riemann-Lebesgue lemma which says 
that for any well-behaved function 



lim / dxf(x)e ilJX = (3.98) 

Why is this relevant? The point is that the Yl n ^ n (3-97) is really an integral J dn, 
because all states are part of a continuum due to the momentum. (There is a caveat 
here: we want the vacuum \Q) to be special, so that it sits on its own, away from the 
continuum of the integral. This means that we must be working in a theory with a 
mass gap - i.e. with no massless particles). So the Riemann-Lebesgue lemma gives us 

lim |0) = H) <n| 0) (3.99) 

t'— >— oo 

(Notice that to derive this result, Peskin and Schroeder instead send t — > — oo in a 
slightly imaginary direction, which also does the job). We now apply the formula 
(3.99), to the top and bottom of the right-hand side of (3.95) to find 

(0|fi)(fi|fi)(fi|0) 1 J 

which, using the normalization fi) = 1, gives us the left-hand side, completing the 
proof. □. 

3.7.1 Connected Diagrams and Vacuum Bubbles 

We're getting closer to our goal of computing the Green's functions G^ n ' since we 
can compute both (0| T<pj{x\) . . . (f>i(x n ) S |0) and (0| S |0) using the same methods 
we developed for S-matrix elements; namely Dyson's formula and Wick's theorem or, 
alternatively, Feynman diagrams. But what about dividing one by the other? What's 
that all about? In fact, it has a simple interpretation. For the following discussion, 
we will work in <j) A theory. Since there is no ambiguity in the different types of line 
in Feynman diagrams, we will represent the particles as solid lines, rather than the 
dashed lines that we used previously. Then we have the diagramatic expansion for 
(0|5|0>. 



<0|o i |0} = l + g+( O + (J) + g g )+-.. (3.101) 



These diagrams are called vacuum bubbles. The combinatoric factors (as well as the 
symmetry factors) associated with each diagram are such that the whole series sums 
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to an exponential, 



S\0) = exp ^ Q + O + (JJ) + ...^ (3.102) 



So the amplitude for the vacuum of the free theory to evolve into itself is (0| S |0) = 
exp (all distinct vacuum bubbles). A similar combinatoric simplification occurs for generic 
correlation functions. Remarkably, the vacuum diagrams all add up to give the same 
exponential. With a little thought one can show that 

(0\T<pi... <p n S |0) = connected diagrams) (0| S |0) (3.103) 

where "connected" means that every part of the diagram is connected to at least one 
of the external legs. The upshot of all this is that dividing by (0| S |0) has a very nice 
interpretation in terms of Feynman diagrams: we need only consider the connected 
Feynman diagrams, and don't have to worry about the vacuum bubbles. Combining 
this with (3.95), we learn that the Green's functions G^ n '( be calculated 

by summing over all connected Feynman diagrams, 

(Q\ T . . . 4>H{x n ) |^) = Connected Feynman Graphs (3.104) 

An Example: The Four-Point Correlator: T<p H (xi) . . . 0# (a^) \Vt) 

As a simple example, let's look at the four-point correlation function in <f) A theory. The 
sum of connected Feynman diagrams is given by, 

Xi X2 

Xi X 2 Xl X 2 

° ° + 2 Similar + JX^ + X + 5 Similar + 

x 3 ° °x 4 x 3 ^ ^x 4 x 3 jXX, x 4 



All of these are connected diagrams, even though they don't look 
that connected! The point is that a connected diagram is defined 
by the requirement that every line is joined to an external leg. An 
example of a diagram that is not connected is shown in the figure. 
As we have seen, such diagrams are taken care of in shifting the 
vacuum from |0) to \ 

Feynman Rules 

The Feynman diagrams that we need to calculate for the Green's functions depend on 
Xi, . . . ,x n . This is rather different than the Feynman diagrams that we calculated for 




Figure 20: 
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the S-matrix elements, where we were working primarily with momentum eigenstates, 
and ended up integrating over all of space. However, it's rather simple to adapt the 
Feynman rules that we had earlier in momentum space to compute G^ n \xi . . . ,x n ). 
For <f) A theory, we have 

• Draw n external points x±, . . . ,x n , connected by the usual propagators and ver- 
tices. Assign a spacetime position y to the end of each line. 

• For each line x y from xtoy write down a factor of the Feynman propagator 

A F (x - y). 

• For each vertex & t position y, write down a factor of — iX J d 4 y. 
3.7.2 From Green's Functions to S-Matrices 

Having described how to compute correlation functions using Feynman diagrams, let's 
now relate them back to the S-matrix elements that we already calculated. The first 
step is to perform the Fourier transform, 

G^(x u ...,x n ) (3.105) 

These are very closely related to the S-matrix elements that we've computed above. The 
difference is that the Feynman rules for G^ n \xi, . . . , x n ), effectively include propagators 
Ap for the external legs, as well as the internal legs. A related fact is that the 4- 
momenta assigned to the external legs is arbitrary: they are not on-shell. Both of these 
problems are easily remedied to allow us to return to the S-matrix elements: we need 
to simply cancel off the propagators on the external legs, and place their momentum 
back on shell. We have 

n' n 

. . . ,p'J S-l\ Pl ..., Pn ) = H)" + "' JIM 2 - m 2 ) - m 2 ) (3.106) 

i=\ j=i 

x & n+n '\-p' 1 ,...,-p' n „p 1 ,...,p n ) 

Each of the factors (p 2 —m 2 ) vanishes once the momenta are placed on-shell. This means 
that we only get a non-zero answer for diagrams contributing to G^ n '(xi, . . . , x n ) which 
have propagators for each external leg. You might think they all do, but it's not true! 
Only diagrams that are fully connected, meaning each external point is connected to 
each other external point, have this property. For example, of the diagrams that we 
wrote down which contribute to the four-point function {Q\ T0#(xi) . . . 4>h(x4) only 
will survive the multiplication by on-shell propagators in (3.106) to contribute to 
the S-matrix for meson scattering in 4 theory. 
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So what's the point of all of this? We've understood that ignoring the connected 
diagrams is related to shifting to the true vacuum But other than that, introducing 
the Green's functions seems like a lot of bother for little reward. The important point 
is that this provides a framework in which to deal with the true particle states in 
the interacting theory through renormalization. Indeed, the formula (3.106), suitably 
interpreted, remains true even in the interacting theory, taking into account the swarm 
of virtual particles surrounding asymptotic states. This is the correct way to consider 
scattering. In this context, (3.106) is known as the LSZ reduction formula. You will 
derive it properly next term. 
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4. The Dirac Equation 



"A great deal more was hidden in the Dirac equation than the author had 
expected when he wrote it down in 1928. Dirac himself remarked in one of 
his talks that his equation was more intelligent than its author. It should 
be added, however, that it was Dirac who found most of the additional 
insights. Weisskopf on Dirac 

So far we've only discussed scalar fields such that under a Lorentz transformation 
x^ 1 — > = A p u x u , the field transforms as 

<f>(x) ->• (f)'(x) = (j)(A~ l x) (4.1) 

We have seen that quantization of such fields gives rise to spin particles. But most 
particles in Nature have an intrinsic angular momentum, or spin. These arise naturally 
in field theory by considering fields which themselves transform non-trivially under the 
Lorentz group. In this section we will describe the Dirac equation, whose quantization 
gives rise to fermionic spin 1/2 particles. To motivate the Dirac equation, we will start 
by studying the appropriate representation of the Lorentz group. 

A familiar example of a field which transforms non-trivially under the Lorentz group 
is the vector field A^x) of electromagnetism, 

A"(x) -»• A^A^A^x) (4.2) 

We'll deal with this in Section 6. (It comes with its own problems!). In general, a field 
can transform as 

<P(x) -+ DiAF^A-ty (4.3) 

where the matrices D[A] form a representation of the Lorentz group, meaning that 

£>[Ai]D[A 2 ] = £>[AiA 2 ] (4.4) 

and .DA" 1 ] = DA] -1 and D[l] = 1. How do we find the different representations? 
Typically, we look at infinitesimal transformations of the Lorentz group and study the 
resulting Lie algebra. If we write, 

A"„ = *"„ + < (4.5) 



for infinitesimal u, then the condition for a Lorentz transformation A^ a A v p Vj ap = rj 
becomes the requirement that u is anti- symmetric: 

uj^ + u »i* = (4.6) 
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Note that an antisymmetric 4x4 matrix has 4x3/2 = 6 independent components, 
which agrees with the 6 transformations of the Lorentz group: 3 rotations and 3 boosts. 
It's going to be useful to introduce a basis of these six 4 x 4 anti-symmetric matrices. 
We could call them (M A ) pp ', with A = 1, . . . , 6. But in fact it's better for us (although 
initially a little confusing) to replace the single index A with a pair of antisymmetric 
indices [per], where p, a = 0, . . . , 3, so we call our matrices (M pa ) p v . The antisymmetry 
on the p and a indices means that, for example, M 01 = —M 10 , etc, so that p and a 
again label six different matrices. Of course, the matrices are also antisymmetric on 
the pv indices because they are, after all, antisymmetric matrices. With this notation 
in place, we can write a basis of six 4x4 antisymmetric matrices as 

{Mry = v m v au - ^ V (4.7) 

where the indices p and v are those of the 4x4 matrix, while p and a denote which 
basis element we're dealing with. If we use these matrices for anything practical (for 
example, if we want to multiply them together, or act on some field) we will typically 
need to lower one index, so we have 

{M pc Y v = rj pfl b~\ - rj aix 5 p „ (4.8) 

Since we lowered the index with the Minkowski metric, we pick up various minus signs 
which means that when written in this form, the matrices are no longer necessarily 
antisymmetric. Two examples of these basis matrices are, 



(M oi r, = i ;; : and (m^y u = i ; : ° i (4.9) 





The first, A^ 01 , generates boosts in the x l direction. It is real and symmetric. The 
second, Ai 12 , generates rotations in the (x 1 , x 2 )-plane. It is real and antisymmetric. 
We can now write any as a linear combination of the Ai pa , 

< = | fi pCT (ATX (4-10) 

where Q pcr are just six numbers (again antisymmetric in the indices) that tell us what 
Lorentz transformation we're doing. The six basis matrices M. pa are called the gen- 
erators of the Lorentz transformations. The generators obey the Lorentz Lie algebra 
relations, 

[M pa , M TV ] = r] aT M pv - T] pT M av + if - r] av M pT (4.11) 
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where we have suppressed the matrix indices. A finite Lorentz transformation can then 
be expressed as the exponential 



A = exp(±n pa M pa ) (4.12) 

Let me stress again what each of these objects are: the }A pa are six 4x4 basis elements 
of the Lorentz Lie algebra; the Q pa are six numbers telling us what kind of Lorentz 
transformation we're doing (for example, they say things like rotate by 9 = n/7 about 
the x 3 -direction and run at speed v = 0.2 in the x 1 direction). 

4.1 The Spinor Representation 

We're interested in finding other matrices which satisfy the Lorentz algebra commuta- 
tion relations (4.11). We will construct the spinor representation. To do this, we start 
by defining something which, at first sight, has nothing to do with the Lorentz group. 
It is the Clifford algebra, 

{ 7 ^, Y} = YY + YY = 2rTi (4.13) 

where 7 M , with \i = 0, 1, 2, 3, are a set of four matrices and the 1 on the right-hand side 
denotes the unit matrix. This means that we must find four matrices such that 

y 7 " = ~YY when /I ^ v (4.14) 

and 

( 7 °) 2 = 1 , (Y) 2 = -l » = 1,2,3 (4.15) 

It's not hard to convince yourself that there are no representations of the Clifford 
algebra using 2 x 2 or 3 x 3 matrices. The simplest representation of the Clifford 
algebra is in terms of 4 x 4 matrices. There are many such examples of 4 x 4 matrices 
which obey (4.13). For example, we may take 

where each element is itself a 2 x 2 matrix, with the a 1 the Pauli matrices 

-(::) • -Co 1 ) • <-> 

which themselves satisfy {a\a^} = 2<? J . 
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One can construct many other representations of the Clifford algebra by taking 
Vj^V -1 for any invertible matrix V. However, up to this equivalence, it turns out 
that there is a unique irreducible representation of the Clifford algebra. The matrices 
(4.16) provide one example, known as the Weyl or chiral representation (for reasons 
that will soon become clear). We will soon restrict ourselves further, and consider only 
representations of the Clifford algebra that are related to the chiral representation by 
a unitary transformation V. 

So what does the Clifford algebra have to do with the Lorentz group? Consider the 
commutator of two 7 M , 

S" = - [Y, 71 = { , ° p , 9 = .° ) = \YY - \r (4-18) 
4 { 2 7 P 7 P + ° J 1 

Let's see what properties these matrices have: 
Claim 4.1: [S^, Y] = Yv up ~ 7 V 
Proof: When \x ^ v we have 

= \yyy - \yyy 

= \y{i\y) - \yyy - \{y,y yr + §vw 

= Yv up - 7 V □ 

Claim 4.2: The matrices S^" form a representation of the Lorentz algebra (4.11), 
meaning 

[S p », S pa ] = 7] up S pCT - r) w S v ° + r] pa S up - fS" p (4.19) 
Proof: Taking p ^ a, and using Claim 4.1 above, we have 

[s^,s pa ] = \[s pv ,YY] 

= i[s p \Y]Y + lY[s p \r] 

= \YYv vp - \YYv pp + \YYv va - \YYv ap (4.20) 

Now using the expression (4.18) to write 7^7°" = 2S pcr + r] pcr , we have 

[S^, S pa ] = S pa r] vp - S va ri pp + S^rT - S pv r] ap (4.21) 
which is our desired expression. □ 
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4.1.1 Spinors 

The S pu are 4x4 matrices, because the 7 M are 4x4 matrices. So far we haven't given 
an index name to the rows and columns of these matrices: we're going to call them 
a,p= 1,2,3,4. 

We need a field for the matrices {S pu ) a p to act upon. We introduce the Dirac spinor 
field ip a (x), an object with four complex components labelled by a = 1,2,3,4. Under 
Lorentz transformations, we have 



where 



i) a {x) ->■ S[A] a ^ip^(A~ 1 x) 
S[A] = exp(lfi p(T ^) 



(4.22) 

(4.23) 
(4.24) 



Although the basis of generators M. pa and S pcr are different, we use the same six 
numbers Q pcr in both A and S[A\: this ensures that we're doing the same Lorentz 
transformation on x and ip. Note that we denote both the generator S pcr and the full 
Lorentz transformation S[A] as "S"'. To avoid confusion, the latter will always come 
with the square brackets [A]. 

Both A and S[A] are 4x4 matrices. So how can we be sure that the spinor repre- 
sentation is something new, and isn't equivalent to the familiar representation A^? To 
see that the two representations are truly different, let's look at some specific transfor- 
mations. 

Rotations 

*H(:^(:,:H-f:;) <-> 

If we write the rotation parameters as fiy = —€ijkip k (meaning Q12 = — y? 3 , etc) then 
the rotation matrix becomes 

( +10-8/2 q \ 
e^J (426) 

where we need to remember that f2i 2 = —^21 = — y? 3 when following factors of 2. 
Consider now a rotation by 2ir about, say, the x 3 -axis. This is achieved by tp = (0, 0, 2tc), 
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and the spinor rotation matrix becomes 



T 3 



Therefore under a 27r rotation 

ip a (x) -» -i; a (x) (4.28) 

which is definitely not what happens to a vector! To check that we haven't been 
cheating with factors of 2, let's see how a vector would transform under a rotation by 
0= (0,0,<^ 3 ). We have 







A = exp (| Sl pa M»°) = exp ° ° * J (4.29) 







So when we rotate a vector by ip 3 = 2n, we learn that A = 1 as you would expect. So 
S[A] is definitely a different representation from the familiar vector representation A^. 




- .'.I M /.)=U-f°A (4.30, 



Boosts 



-a 1 J * \ a 1 
Writing the boost parameter as f2j = — = Xh we have 

S|A) =( e-H (4 ' 31) 
Representations of the Lorentz Group are not Unitary 

Note that for rotations given in (4.26), S[A] is unitary, satisfying S^Aj^A] = 1. But 
for boosts given in (4.31), S[A] is not unitary. In fact, there are no finite dimensional 
unitary representations of the Lorentz group. We have demonstrated this explicitly for 
the spinor representation using the chiral representation (4.16) of the Clifford algebra. 
We can get a feel for why it is true for a spinor representation constructed from any 
representation of the Clifford algebra. Recall that 

5[A]=exp(|^ CT ^) (4.32) 

so the representation is unitary if are anti-hermitian, i.e. (S^Y = —S^ u . But we 
have 

{Sn ] = -\[W\{l v ) ] ] (4-33) 
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which can be anti-hermitian if all 7^ are hermitian or all are anti-hermitian. However, 
we can never arrange for this to happen since 



(7 ) 2 = 1 =>- Real Eigenvalues 
(7*) 2 = — 1 =>- Imaginary Eigenvalues 



(4.34) 



So we could pick 7 to be hermitian, but we can only pick 7* to be anti-hermitian. 
Indeed, in the chiral representation (4.16), the matrices have this property: (7°)^ = 7 
and (7*) ' = —7*. In general there is no way to pick 7^ such that are anti-hermitian. 

4.2 Constructing an Action 

We now have a new field to work with, the Dirac spinor ip. We would like to construct 
a Lorentz invariant equation of motion. We do this by constructing a Lorentz invariant 
action. 

We will start in a naive way which won't work, but will give us a clue how to proceed. 
Define 



which is the usual adjoint of a multi-component object. We could then try to form a 
Lorentz scalar by taking the product ^ r> with the spinor indices summed over. Let's 
see how this transforms under Lorentz transformations, 



So ip^(x)ijj(x) — > ijy (A~ 1 x)S[A] Jf S[A}ip(A~ 1 x). But, as we have seen, for some Lorentz 
transformation 5 [A] '5 [A] ^ 1 since the representation is not unitary. This means that 
ifjty isn't going to do it for us: it doesn't have any nice transformation under the 
Lorentz group, and certainly isn't a scalar. But now we see why it fails, we can also see 
how to proceed. Let's pick a representation of the Clifford algebra which, like the chiral 
representation (4.16), satisfies (7°)^ = 7 and (7*)"'" = —7*. Then for all fi = 0,1,2,3 
we have 



(4.35) 



ip(x) ->■ S[A]-0(A~V) 
^(x) -)• ^ t (A" 1 x),S[A] t 



(4.36) 



tVt = (ry 



(4.37) 



which, in turn, means that 




1 







(4.38) 



1 
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so that 

5[A]t = exp (± n^OS^t) = 7 ° S[A]" Y (4.39) 

With this in mind, we now define the Dirac adjoint 

${x) = ^ t (x) 7 ° (4.40) 

Let's now see what Lorentz covariant objects we can form out of a Dirac spinor tp and 
its adjoint ■0. 

Claim 4.3: tpip is a Lorentz scalar. 
Proof: Under a Lorentz transformation, 

-)• ^(A^rr) S^V StA^A -1 *) 

= 0(A" 1 x)^(A" 1 x) (4.41) 

which is indeed the transformation law for a Lorentz scalar. □ 

Claim 4.4: ifj^^ip is a Lorentz vector, which means that 

^(x) 7 M V(z) A^^(A^x) Y ^(A"V) (4.42) 

This equation means that we can treat the /i — 0, 1,2,3 index on the 7 M matrices as 

a true vector index. In particular we can form Lorentz scalars by contracting it with 
other Lorentz indices. 

Proof: Suppressing the x argument, under a Lorentz transformation we have, 

07^ ^S[A]"V S[A]i/> (4.43) 

If ijj 7^ ip is to transform as a vector, we must have 

S^A] -1 7^ S[A\ — A^j-f (AAA) 

We'll now show this. We work infinitesimally, so that 

A = exp (I n pc M pa ) « 1 + \n pa M p<T + ... (4.45) 

5[A] = exp (| « 1 + 1^5^ + . . . (4.46) 
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so the requirement (4.44) becomes 



-[S pa ,Y] = (M pa y u Y (4.47) 

where we've suppressed the a, (3 indices on 7^ and S^, but otherwise left all other 
indices explicit. In fact equation (4.47) follows from Claim 4.1 where we showed that 
[5 ,po ",7 M ] = 7^7/°"^ — y^w. To see this, we write the right-hand side of (4.47) by 
expanding out Ai, 

(Mnw = - ir w 

= ri PP Y - rfV (4.48) 

which means that the proof follows if we can show 

- [S^, Y] = V P ^l a - V ap Y (4-49) 

which is exactly what we proved in Claim 4.1. □ 

Claim 4.5: %p r j Pir ) u %l) transforms as a Lorentz tensor. More precisely, the symmet- 
ric part is a Lorentz scalar, proportional to rf v ipip, while the antisymmetric part is a 
Lorentz tensor, proportional to tpS^ip. 

Proof: As above. □ 

We are now armed with three bilinears of the Dirac field, ipip, ipj^ip and ^7^7^-^, 
each of which transforms covariantly under the Lorentz group. We can try to build a 
Lorentz invariant action from these. In fact, we need only the first two. We choose 

S = J d 4 x ${x) {i^d^ - m) if>(x) (4.50) 

This is the Dirac action. The factor of "i" is there to make the action real; upon complex 
conjugation, it cancels a minus sign that comes from integration by parts. (Said another 
way, it's there for the same reason that the Hermitian momentum operator — zV in 
quantum mechanics has a factor i). As we will see in the next section, after quantization 
this theory describes particles and anti-particles of mass \m\ and spin 1/2. Notice that 
the Lagrangian is first order, rather than the second order Lagrangians we were working 
with for scalar fields. Also, the mass appears in the Lagrangian as m, which can be 
positive or negative. 
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4.3 The Dirac Equation 

The equation of motion follows from the action (4.50) by varying with respect to ip and 
i[) independently. Varying with respect to ip, we have 

(ry^ - m) %j) = (4.51) 

This is the Dirac equation. It's completely gorgeous. Varying with respect to ip gives 
the conjugate equation 

id^tp 7^ + rrnp = (4.52) 

The Dirac equation is first order in derivatives, yet miraculously Lorentz invariant. If 
we tried to write down a first order equation of motion for a scalar field, it would look 
like v^d^ip = . . ., which necessarily includes a privileged vector in spacetime v M and is 
not Lorentz invariant. However, for spinor fields, the magic of the 7 M matrices means 
that the Dirac Lagrangian is Lorentz invariant. 

The Dirac equation mixes up different components of ip through the matrices 7^. 
However, each individual component itself solves the Klein-Gordon equation. To see 
this, write 

{yfd v + m){i'fd ll - m) V = - {'f'fd^ + m 2 ) i> = (4.53) 

But 7 A V0 / A = |{7 M :T 1 '}^^ = so we S et 

-(d^ + m 2 )^ = (4.54) 

where this last equation has no 7^ matrices, and so applies to each component ip a , with 
a = 1,2,3,4. 

The Slash 

Let's introduce some useful notation. We will often come across 4- vectors contracted 
with 7 M matrices. We write 

A^Y = 4 (4.55) 



so the Dirac equation reads 

(i0 — m)ip = 



(4.56) 



4.4 Chiral Spinors 

When we've needed an explicit form of the 7^ matrices, we've used the chiral represen- 
tation 

In this representation, the spinor rotation transformation S^A^] and boost transfor- 
mation .SjAboost] were computed in (4.26) and (4.31). Both are block diagonal, 

/ e +i<p-ff/2 q \ f e +x-a/2 q \ 

S[Kot] = and ,S[A boost ] = _ (4.58) 

\ e+^Z 2 J \ e~ x - a / 2 ) 

This means that the Dirac spinor representation of the Lorentz group is reducible. It 
decomposes into two irreducible representations, acting only on two-component spinors 
u± which, in the chiral representation, are defined by 

The two-component objects u± are called Weyl spinors or chiral spinors. They trans- 
form in the same way under rotations, 

but oppositely under boosts, 

u± ->■ e ± * s/2 u± (4.61) 

In group theory language, u + is in the (|,0) representation of the Lorentz group, 
while U- is in the (0, |) representation. The Dirac spinor ifj lies in the (|,0) © (0, |) 
representation. (Strictly speaking, the spinor is a representation of the double cover of 
the Lorentz group SL(2, C)). 

4.4.1 The Weyl Equation 

Let's see what becomes of the Dirac Lagrangian under the decomposition (4.59) into 
Weyl spinors. We have 

C = ip{i — m)ip = iuLa^d^U- + iu\_a^d^u + — m(u' + U- + v)_u + ) = (4.62) 
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where we have introduced some new notation for the Pauli matrices with a /z = 0, 1, 2, 3 
index, 

<7" = (1,<7*) and ^ = (1,-^) (4.63) 

From (4.62), we see that a massive fermion requires both u + and w_, since they couple 
through the mass term. However, a massless fermion can be described by u + (or u_) 
alone, with the equation of motion 

ia' 1 d^u + = 

or ia^d^u_ = (4.64) 

These are the Weyl equations. 
Degrees of Freedom 

Let me comment here on the degrees of freedom in a spinor. The Dirac fermion has 
4 complex components = 8 real components. How do we count degrees of freedom? 
In classical mechanics, the number of degrees of freedom of a system is equal to the 
dimension of the configuration space or, equivalently, half the dimension of the phase 
space. In field theory we have an infinite number of degrees of freedom, but it makes 
sense to count the number of degrees of freedom per spatial point: this should at least 
be finite. For example, in this sense a real scalar field <fi has a single degree of freedom. 
At the quantum level, this translates to the fact that it gives rise to a single type of 
particle. A classical complex scalar field has two degrees of freedom, corresponding to 
the particle and the anti-particle in the quantum theory. 

But what about a Dirac spinor? One might think that there are 8 degrees of freedom. 
But this isn't right. Crucially, and in contrast to the scalar field, the equation of motion 
is first order rather than second order. In particular, for the Dirac Lagrangian, the 
momentum conjugate to the spinor ip is given by 

vr^ = dC/dip = (4.65) 

It is not proportional to the time derivative of ip. This means that the phase space for 
a spinor is therefore parameterized by ip and while for a scalar it is parameterized 
by <f) and it = <fi. So the phase space of the Dirac spinor ip has 8 real dimensions and 
correspondingly the number of real degrees of freedom is 4. We will see in the next 
section that, in the quantum theory, this counting manifests itself as two degrees of 
freedom (spin up and down) for the particle, and a further two for the anti-particle. 

A similar counting for the Weyl fermion tells us that it has two degrees of freedom. 
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4.4.2 7 5 

The Lorentz group matrices S[A] came out to be block diagonal in (4.58) because we 
chose the specific representation (4.57). In fact, this is why the representation (4.57) 
is called the chiral representation: it's because the decomposition of the Dirac spinor 
ip is simply given by (4.59). But what happens if we choose a different representation 
7 M of the Clifford algebra, so that 

y _> WfU' 1 and -> Uip ? (4.66) 

Now S[A] will not be block diagonal. Is there an invariant way to define chiral spinors? 
We can do this by introducing the "fifth" gamma-matrix 

7 5 = -2 7 y 7 y (4.67) 

You can check that this matrix satisfies 

{ 7 5 ,y} = and ( 7 5 ) 2 = +l (4.68) 

The reason that this is called 7 5 is because the set of matrices 7 A = ( 7 M ,z 7 5 ), with 
A = 0,1,2,3,4 satisfy the five-dimensional Clifford algebra {^f A ,'j B } = 2r] AB . (You 
might think that 7 4 would be a better name! But 7 5 is the one everyone chooses - it's 
a more sensible name in Euclidean space, where A = 1,2,3,4,5). You can also check 
that [5' (UI/ , 7 5 ] = 0, which means that 7 5 is a scalar under rotations and boosts. Since 
( 7 5 ) 2 = 1, this means we may form the Lorentz invariant projection operators 

P ± = ^(l± 7 5 ) (4.69) 

such that P? = P + and P 2 = P_ and P+P- = 0. One can check that for the chiral 
representation (4.57), 

Hi) 

from which we see that the operators P± project onto the Weyl spinors u±. However, 
for an arbitrary representation of the Clifford algebra, we may use 7 5 to define the 
chiral spinors, 

V>± = P±^ (4-71) 

which form the irreducible representations of the Lorentz group. ip + is often called a 
"right-handed" spinor, while ^_ is "left-handed". 
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4.4.3 Parity 

The spinors ip± are related to each other by parity. Let's pause to define this concept. 
The Lorentz group is defined by x^ — > t^ v x u such that 

A^A p a V ua = rf p (4.72) 

So far we have only considered transformations A which are continuously connected to 
the identity; these are the ones which have an infinitesimal form. However there are 
also two discrete symmetries which are part of the Lorentz group. They are 

Time Reversal T : x° — > —x° ; x % — > x % 

Parity P : x° ->• x° ; x l -> -x l (4.73) 

We won't discuss time reversal too much in this course. (It turns out to be represented 
by an anti-unitary transformation on states. See, for example the book by Peskin and 
Schroeder). But parity has an important role to play in the standard model and, in 
particular, the theory of the weak interaction. 

Under parity, the left and right-handed spinors are exchanged. This follows from the 
transformation of the spinors under the Lorentz group. In the chiral representation, we 
saw that the rotation (4.60) and boost (4.61) transformations for the Weyl spinors u± 
are 

u ± ^ e^ 3 ' 2 u ± and u ± ^ e ^ 3 / 2 u ± (4.74) 

Under parity, rotations don't change sign. But boosts do flip sign. This confirms that 
parity exchanges right-handed and left-handed spinors, P : u± — >• u T , or in the notation 
tp± = |(1 ± 7 5 )^, we have 

P:iP±(x,t)^i> T (-x,t) (4.75) 

Using this knowledge of how chiral spinors transform, and the fact that P 2 = 1, we see 
that the action of parity on the Dirac spinor itself can be written as 

P : ip{x,t) 7 ( V(-f,t) (4.76) 

Notice that if ip(x,t) satisfies the Dirac equation, then the parity transformed spinor 
^y i(j(—x,t) also satisfies the Dirac equation, meaning 

(z7°<9t + ij% - m) 7 V(-x, t) = 7°(z7°<9 t - ifdi - m)^(-x, t) = (4.77) 

where the extra minus sign from passing 7 through 7* is compensated by the derivative 
acting on — x instead of +x. 
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4.4.4 Chiral Interactions 

Let's now look at how our interaction terms change under parity. We can look at each 
of our spinor bilinears from which we built the action, 

P : 4>ip(x,t) ->■ $ij)(-x,t) (4.78) 

which is the transformation of a scalar. For the vector ■07 / V> we can l°°k & t the 
temporal and spatial components separately, 

P : ^7>(f,t) ->■ ^°ijj(-x,t) 

P : $ 7 ty(£,t) -> ^7°7 < 7°^(-f,i) = -^ 7 V(-x,t) (4.79) 

which tells us that ^^ip transforms as a vector, with the spatial part changing sign. 
You can also check that ^S^vp transforms as a suitable tensor. 

However, now we've discovered the existence of 7 5 , we can form another Lorentz 
scalar and another Lorentz vector, 

^7 5 ^ and (4.80) 

How do these transform under parity? We can check: 

P : ^7 5 ^(f,t) -> ^7°7 5 7° ^(-£,f) = -iJn 5 ^(-x,t) (4.81) 

P : ^7 5 7"^(f,i) ->■ ^7° 7 5 7' i 7V(-^*) = 



-^ 7 5 7 X-f,i) /i = 
+'07 5 7*'0(— x, t) /i = i 



which means that ip^ip transforms as a pseudoscalar, while ifj / -f 5/ j fJ- ilj transforms as an 
axia/ vector. To summarize, we have the following spinor bilinears, 





scalar 


■07^ 


vector 




tensor 




pseudoscalar 


•07 5 7 M *0 


axial vector 



(4.82) 

The total number of bilinears is 1 + 4 + (4 x 3/2) + 4 + 1 = 16 which is all we could 
hope for from a 4-component object. 



- 95 - 



We're now armed with new terms involving 7 s that we can start to add to our 
Lagrangian to construct new theories. Typically such terms will break parity invariance 
of the theory, although this is not always true. (For example, the term cpip^ip doesn't 
break parity if <fr is itself a pseudoscalar). Nature makes use of these parity violating 
interactions by using 7 s in the weak force. A theory which treats ip± on an equal 
footing is called a vector-like theory. A theory in which %p + and ip_ appear differently 
is called a chiral theory. 

4.5 Major ana Fermions 

Our spinor ip a is a complex object. It has to be because the representation S[A] 
is typically also complex. This means that if we were to try to make ip real, for 
example by imposing ip = tp*, then it wouldn't stay that way once we make a Lorentz 
transformation. However, there is a way to impose a reality condition on the Dirac 
spinor ip. To motivate this possibility, it's simplest to look at a novel basis for the 
Clifford algebra, known as the Majorana basis. 

These matrices satisfy the Clifford algebra. What is special about them is that they 
are all pure imaginary (7^)* = — 7 M . This means that the generators of the Lorentz 
group S^ u = |[7 M , 7"], and hence the matrices S[A] are real. So with this basis of the 
Clifford algebra, we can work with a real spinor simply by imposing the condition, 

ip = tp* (4.83) 

which is preserved under Lorentz transformation. Such spinors are called Majorana 
spinors. 

So what's the story if we use a general basis for the Clifford algebra? We'll ask only 
that the basis satisfies (7°)^ = 7 and (7*)' = —7*. We then define the charge conjugate 
of a Dirac spinor ip as 

t// c ) = Cip* (4.84) 

Here C is a 4 x 4 matrix satisfying 

C ] C = 1 and CVC = -(7T ( 4 -85) 

Let's firstly check that (4.84) is a good definition, meaning that ip^ transforms nicely 
under a Lorentz transformation. We have 



i) {c) -»• CS[A]*ip* = S[A]Cip* = S[A]^ 



(4.86) 



where we've made use of the properties (4.85) in taking the matrix C through S[A]*. 
In fact, not only does ip^ transform nicely under the Lorentz group, but if ip satisfies 
the Dirac equation, then ip^ does too. This follows from, 

(i0-m)tp = O (-i 0* - m)ip* = 

=> C(-i 0* - m)i/j* = (+i$- m)ip {c) = 

Finally, we can now impose the Lorentz invariant reality condition on the Dirac spinor, 
to yield a Major ana spinor, 

V> (c) = t/j (4.87) 

After quantization, the Majorana spinor gives rise to a fermion that is its own anti- 
particle. This is exactly the same as in the case of scalar fields, where we've seen that 
a real scalar field gives rise to a spin boson that is its own anti-particle. (Be aware: 
In many texts an extra factor of 7 is absorbed into the definition of C). 

So what is this matrix C? Well, for a given representation of the Clifford algebra, it 
is something that we can find fairly easily. In the Majorana basis, where the gamma 
matrices are pure imaginary, we have simply CMaj = 1 and the Majorana condition 
ip = ip^ becomes ip = ip*. In the chiral basis (4.16), only 7 2 is imaginary, and we 
may take C c hirai = i>l 2 = ^ _° a 2 " j • (The matrix ia 2 that appears here is simply the 
anti-symmetric matrix e a/3 ). It is interesting to see how the Majorana condition (4.87) 
looks in terms of the decomposition into left and right handed Weyl spinors (4.59). 
Plugging in the various definitions, we find that u + = ia 2 u*_ and w_ = —i<j 2 u* + . In 
other words, a Majorana spinor can be written in terms of Weyl spinors as 

V> = ( U+ 2 ) (4.88) 
V —ia u\ J 

Notice that it's not possible to impose the Majorana condition ip = tp( c > at the same 
time as the Weyl condition (w_ = or u + = 0). Instead the Majorana condition relates 
u_ and u + . 

An Aside: Spinors in Different Dimensions: The ability to impose Majorana 
or Weyl conditions on Dirac spinors depends on both the dimension and the signature of 
spacetime. One can always impose the Weyl condition on a spinor in even dimensional 
Minkowski space, basically because you can always build a suitable "7 s " projection 
matrix by multiplying together all the other 7-matrices. The pattern for when the 
Majorana condition can be imposed is a little more sporadic. Interestingly, although 
the Majorana condition and Weyl condition cannot be imposed simultaneously in four 
dimensions, you can do this in Minowski spacetimes of dimension 2, 10, 18, . . .. 
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4.6 Symmetries and Conserved Currents 

The Dirac Lagrangian enjoys a number of symmetries. Here we list them and compute 
the associated conserved currents. 

Spacetime Translations 

Under spacetime translations the spinor transforms as 

5ip = e^d^ifj (4.89) 

The Lagrangian depends on d p ip, but not d^ip, so the standard formula (1.41) gives us 
the energy-momentum tensor 

T pv = i^d v i) - rj^C (4.90) 

Since a current is conserved only when the equations of motion are obeyed, we don't lose 
anything by imposing the equations of motion already on T^ v . In the case of a scalar 
field this didn't really buy us anything because the equations of motion are second 
order in derivatives, while the energy-momentum is typically first order. However, for 
a spinor field the equations of motion are first order: {i$ — m)ip = 0. This means we 
can set £ = in T pu , leaving 

= i^d v i) (4.91) 

In particular, we have the total energy 

B -J** T °° = / d ' x im = / * VHva - + m » (492) 

where, in the last equality, we have again used the equations of motion. 
Lorentz Transformations 

Under an infinitesimal Lorentz transformation, the Dirac spinor transforms as (4.22) 
which, in infinitesimal form, reads 

<ty« = -uj^x v d^ a + \Q pcT {S pcJ ) a ^ (4.93) 

where, following (4.10), we have u p u = \VL pa (M pa Y v , and M pa are the generators of 
the Lorentz algebra given by (4.8) 

(M pa Y v = T) m 61 - 7)^81 (4.94) 
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which, after direct substitution, tells us that uj pv = Vt pv '. So we get 

6ip a = -uT [x„d,r - \{S, V T^] (4.95) 

The conserved current arising from Lorentz transformations now follows from the same 
calculation we saw for the scalar field (1.54) with two differences: firstly, as we saw 
above, the spinor equations of motion set C = 0; secondly, we pick up an extra piece 
in the current from the second term in (4.95). We have 

(J»y = X °T PP - x p T^ - iipj^S^ip (4.96) 

After quantization, when (J p ) pa is turned into an operator, this extra term will be 
responsible for providing the single particle states with internal angular momentum, 
telling us that the quantization of a Dirac spinor gives rise to a particle carrying spin 
1/2. 

Internal Vector Symmetry 

The Dirac Lagrangian is invariant under rotating the phase of the spinor, ip — > e~ ia ip. 
This gives rise to the current 

j v = ^ (4.97) 

where "V" stands for vector, reflecting the fact that the left and right-handed compo- 
nents ip± transform in the same way under this symmetry. We can easily check that 
jy is conserved under the equations of motion, 

d^jy = (d^)Y^ + ^Y{df,ip) = irmp^j - irmfnf) = (4.98) 

where, in the last equality, we have used the equations of motion i0ip = mip and 
id ll ip^f p = —mip. The conserved quantity arising from this symmetry is 

Q = j ***** = (4.99) 

We will see shortly that this has the interpretation of electric charge, or particle number, 
for fermions. 

Axial Symmetry 

When m — 0, the Dirac Lagrangian admits an extra internal symmetry which rotates 
left and right-handed fermions in opposite directions, 

tu^e ia ^'i/j and $ -> (4.100) 
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Here the second transformation follows from the first after noting that e %a ^^° = 
7°e +iaT> . This gives the conserved current, 

f A = Vt'VY (4.101) 

where A is for "axial" since j A is an axial vector. This is conserved only when m = 0. 
Indeed, with the full Dirac Lagrangian we may compute 

d„f A = (d^-f^ip + ^7 M 7 5 <9/^ = 2*m^ 7 5 </> (4.102) 

which vanishes only for m = 0. However, in the quantum theory things become more 
interesting for the axial current. When the theory is coupled to gauge fields (in a 
manner we will discuss in Section 6), the axial transformation remains a symmetry 
of the classical Lagrangian. But it doesn't survive the quantization process. It is the 
archetypal example of an anomaly: a symmetry of the classical theory that is not 
preserved in the quantum theory. 

4.7 Plane Wave Solutions 

Let's now study the solutions to the Dirac equation 

(i^d^ - m)ip = (4.103) 

We start by making a simple ansatz: 

if, = u (p) e~ ip - x (4.104) 

where u(p) is a four- component spinor, independent of spacetime x which, as the no- 
tation suggests, can depend on the 3-momentum p. The Dirac equation then becomes 

— TYl T) \ 

^ \ u(p) = (4.105) 
p^ -m J 

where we're again using the definition, 

<7 M = (1,<7*) and ^ = (1,-0-*) (4.106) 
Claim: The solution to (4.105) is 

«®=(^!f) (4-107) 
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for any 2-component spinor £ which we will normalize to £t£ = 1. 



Proof: Let's write u{p) T = (141,1*2)- Then equation (4.105) reads 

(p ■ a) u 2 = mui and (p ■ a)ui = mu 2 (4.108) 

Either one of these equations implies the other, a fact which follows from the identity 
(jP " a )(p " = Po ~ ViVj a%(J ^ — Po ~ PiPjd^ — PfiP^ 1 — m2 - To start with, let's try 
the ansatz U\ = (p • cr)£' for some spinor Then the second equation in (4.108) 
immediately tells us that u 2 = So we learn that any spinor of the form 

u{p) = A[ {P ' (T) J ) (4.109) 

with constant A is a solution to (4.105). To make this more symmetric, we choose 
A = 1/m and £' = y/p ■ a £ with constant £. Then iti = (p • cr)y/p ■ a £ = m-y/p • cx£. So 
we get the promised result (4.107) □ 

Negative Frequency Solutions 

We get further solutions to the Dirac equation from the ansatz 

ijj = v (f) e +ip - x (4.110) 

Solutions of the form (4.104), which oscillate in time as ip ~ e~ tEt , are called positive 
frequency solutions. Those of the form (4.110), which oscillate as ip ~ e +lEt , are 
negative frequency solutions. It's important to note however that both are solutions to 
the classical field equations and both have positive energy (4.92). The Dirac equation 
requires that the 4-component spinor v{p) satisfies 

(y*p M + m)v(p) = ( m P ^ a )v(p) = (4.111) 
\ Pfi®^ rn J 

which is solved by 

V(P1 J^Zl>) (4.112) 
for some 2-component spinor rj which we take to be constant and normalized to rfr) = 1. 
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4.7.1 Some Examples 

Consider the positive frequency solution with mass m and 3-momentum p = 0, 



u(p) = y/m[ (4.113) 




where £ is any 2-component spinor. Spatial rotations of the field act on £ by (4.26), 

£ -> e +i ^ l2 i (4.114) 

The 2-component spinor £ defines the spin of the field. This should be familiar from 
quantum mechanics. A field with spin up (down) along a given direction is described 
by the eigenvector of the corresponding Pauli matrix with eigenvalue +1 (-1 respec- 
tively). For example, £ T = (1,0) describes a field with spin up along the z-axis. After 
quantization, this will become the spin of the associated particle. In the rest of this 
section, we'll indulge in an abuse of terminology and refer to the classical solutions to 
the Dirac equations as "particles" , even though they have no such interpretation before 
quantization. 

Consider now boosting the particle with spin £ T = (1, 0) along the x 3 direction, with 
p^ = (E, 0, 0,p). The solution to the Dirac equation becomes 



u{p)=\ , )l \ = \ , A< (4.115) 





In fact, this expression also makes sense for a massless field, for which E = p 3 . (We 
picked the normalization (4.107) for the solutions so that this would be the case). For 
a massless particle we have 

«(P1 = W;J (4.116) 
Similarly, for a boosted solution of the spin down £ T = (0, 1) field, we have 
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Helicity 

The helicity operator is the projection of the angular momentum along the direction 
of momentum, 

where S" iJ is the rotation generator given in (4.25). The massless field with spin £ T = 
(1,0) in (4.116) has helicity h = 1/2: we say that it is right-handed. Meanwhile, the 
field (4.117) has helicity h = —1/2: it is left-handed. 

4.7.2 Some Useful Formulae: Inner and Outer Products 

There are a number of identities that will be very useful in the following section, 
regarding the inner (and outer) products of the spinors u(p) and v(p). It's firstly 
convenient to introduce a basis £ s and r] s , s = 1, 2 for the two-component spinors such 
that 

grtg, = grs and r} r\ r} s = S r. (4.119) 

for example, 

e= (M and e= r j (4.120) 

and similarly for rj s . Let's deal first with the positive frequency plane waves. The two 
independent solutions are now written as 

We can take the inner product of four-component spinors in two different ways: either 
as iv -u, or asw-u. Of course, only the latter will be Lorentz invariant, but it turns out 
that the former is needed when we come to quantize the theory. Here we state both: 



= r y <t? +c ] p- h s = 2r W = ^ rs (4.122) 

while the Lorentz invariant inner product is 



u r (p) ■ u 



\p) = (r+v^rVp^ r ( ^Zm =^ rs (4.123) 
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We have analogous results for the negative frequency solutions, which we may write as 

V '(p) = ( ) With ^ ' VS{ ® = 2po5TS (4.124) 

\ — y/p -orf J and v r (p) ■ v s (p) = — 2m5 rs 

We can also compute the inner product between u and v. We have 



u r (p) ■ v s (p) = (f t y/p^, C^VP 7 ^) 7° 



Vp^v 8 
-Vp^v 3 

= e*y/(p-*)(p-°)v' ~ eW(p-*)(p-<r)v' = (4.125) 



and similarly, v r (p) ■ u s (p) = 0. However, when we come to tv ■ v, it is a slightly 
different combination that has nice properties (and this same combination appears 
when we quantize the theory). We look at u r \p) ■ v s (—p), with the 3-momentum in 
the spinor v taking the opposite sign. Defining the 4-momentum {p'Y = (p°, —p), we 
have 



u r Hp) ■ v s (-p) = {^Vp 7 ^, ^Vp 7 *) 



\Jp' • o rf 



—y/p 1 ■ orf 

= eW(p-vW-v)v s - eW(p-*w-°)v 8 (4.126) 

Now the terms under the square-root are given by {p-cr){p' -a) = (po+Pi&' l )(po+Pi<7 t ) = 
Po + P 2 - The same expression holds for (p ■ a)(p' ■ a), and the two terms cancel. We 
learn 

u r \p) ■ v s {-p) = v r \p) ■ u s {-p) = (4.127) 

Outer Products 

There's one last spinor identity that we need before we turn to the quantum theory. It is: 
Claim: 

2 

u s {p)u s (p) = 4+m (4.128) 

s=l 

where the two spinors are not now contracted, but instead placed back to back to give 
a 4 x 4 matrix. Also, 

2 

Y v s {p)v s {p) = jf-m (4.129) 

s=l 
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Proof: 

2 



u s (p)u s (p) = { yJ¥ —'l ) (e st Vp 7 *, e j vp^) (4.130) 



But Xls^'C' 5 ^ = 1) the 2x2 unit matrix, which then gives 



us 



£ = [ m V '° ) (4.131) 

s =i \p-a m J 

which is the desired result. A similar proof works for J2 s v s (p)v s (p). □ 
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5. Quantizing the Dirac Field 



We would now like to quantize the Dirac Lagrangian, 

£ = tjj(x) (i — m) if)(x) (5.1) 

We will proceed naively and treat if> as we did the scalar field. But we'll see that things 
go wrong and we will have to reconsider how to quantize this theory. 

5.1 A Glimpse at the Spin-Statistics Theorem 

We start in the usual way and define the momentum, 

dC 

7T = = irjrf = # f (5.2) 
dif> 

For the Dirac Lagrangian, the momentum conjugate to if) is iif)^ . It does not involve 
the time derivative of if). This is as it should be for an equation of motion that is first 
order in time, rather than second order. This is because we need only specify if> and 
ijy on an initial time slice to determine the full evolution. 

To quantize the theory, we promote the field if) and its momentum if)^ to operators, 
satisfying the canonical commutation relations, which read 

[M^)^lm = ^S^(x-y) (5.3) 
It's this step that we'll soon have to reconsider. 

Since we're dealing with a free theory, where any classical solution is a sum of plane 
waves, we may write the quantum operators as 

= E/ (0s ^ [^n%p)e + ^ + cpv s (p)e-^] 
= E / (03 ^ [ b f ti'^te"^ + ctv s ^e + * s ] (5.4) 

where the operators bp create particles associated to the spinors u s (p), while cp create 
particles associated to v s (p). As with the scalars, the commutation relations of the 
fields imply commutation relations for the annihilation and creation operators 
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Claim: The field commutation relations (5.3) are equivalent to 

[c^c^ = -(2n) 3 5 rs 5^(p-q) (5.5) 

with all other commutators vanishing. Note the strange minus sign in the [c, c'] term. 
It's not yet obvious that it's bad, but we should be aware of it. For now, let's just carry 
on. 

Proof: Let's show that the [b, b'] and [c, c'] commutators reproduce the field com- 
mutators (5.3), 



V ' 1 

73 



= E/ (^2^ (u'(^«'(^7°e* ( ^+t;'(^«'®7 o e-* ( ^ i0 X 5 - 6 ) 

At this stage we use the outer product formulae (4.128) and (4.129) which tell us 

J2 S u s (p)u s (p) = p 1 + m and J2 S v s (p)v s (p) = p 1 — m, so that 

mx),^m = J ^_L((y +m ) 7 o e *(*-s) + (y_ m ) 7 o e -*(^)) 

((po7° + PiV + ™h° + (Pol° - Pif - m) T °) e + ^^ 



(2tt) 3 2Ep 

where, in the second term, we've changed p —> —p under the integration sign. Now, 
using p = Ep we have 

Mx),^(y)] = j *L.-H*0HO = 5 (3) (f _ ft (5.7) 

as promised. Notice that it's a little tricky in the middle there, making sure that 
the piY terms cancel. This was the reason we needed the minus sign in the [c, cr] 
commutator terms in (5.5). □ 

5.1.1 The Hamiltonian 

To proceed, let's construct the Hamiltonian for the theory. Using the momentum 
7r = we have 

U = ni) - C = 4>(-ifdi + m)i) (5.8) 
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which means that H = Jd 3 xH agrees with the conserved energy computed using 
Noether's theorem (4.92). We now wish to turn the Hamiltonian into an operator. 
Let's firstly look at 



d 3 p 



(2tt)3 v /2E^ 



b s A-YPi + m)u s (p) e +l ^ + cJ(f Pi + m)v s (p) 



where, for once we've left the sum over s — 1,2 implicit. There's a small subtlety with 
the minus signs in deriving this equation that arises from the use of the Minkowski 
metric in contracting indices, so that p • x = x % p l = —x l pi. Now we use the defining 
equations for the spinors u s {p) and v s {p) given in (4.105) and (4.111), to replace 



(~l l Pi + m)w s 
so we can write 



j°Pou s (p) and (YPi + m)v s (p) = — r y [) pov s (p) (5.9) 



d 3 P [Ep o 
(2vr) 3 V 2 7 



btu s (p) e +l ^ - c s J v s (p) e~^ s (5.10) 



We now use this to write the operator Hamiltonian 



H 



d z xil)^°{—i^di + m)ijj 

b r ^u r {q)^e-^ S + c}v r (q) j e +i ^ 



d 3 xd 3 pd 3 q / Ep 



(2tt) 6 
d 3 p 1 



4£„- 



btu s {p)e +l P- s - c s J v s (p) ] e-^ 



(2tt) 3 2 



^ t K(pT-^(p1] 



-b r Jc%[u r (p)l ■ v s (-p)] + c^[v r (p)^ ■ u s (-p)] 



where, in the last two terms we have relabelled p — > —p. We now use our inner product 
formulae (4.122), (4.124) and (4.127) which read 



w r (p^ • u s {p) = V 



r ^t . v s ( 



2p 5 rs and w r (p) t • v s {-p) = v r (p)^ ■ u s {-p) = 



giving us 



H 



(5.11) 
(5.12) 
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The term is familiar and easily dealt with by normal ordering. However the — c^c 
term is a disaster! The Hamiltonian is not bounded below, meaning that our quantum 
theory makes no sense. Taken seriously it would tell us that we could tumble to states 
of lower and lower energy by continually producing c* particles. As the English would 
say, it's all gone a bit Pete Tong. (No relation). 

Since the above calculation was a little tricky, you might think that it's possible to 
rescue the theory to get the minus signs to work out right. You can play around with 
different things, but you'll always find this minus sign cropping up somewhere. And, 
in fact, it's telling us something important that we missed. 

5.2 Fermionic Quantization 

The key piece of physics that we missed is that spin 1/2 particles are fermions, meaning 
that they obey Fermi-Dirac statistics with the quantum state picking up a minus sign 
upon the interchange of any two particles. This fact is embedded into the structure 
of relativistic quantum field theory: the spin-statistics theorem says that integer spin 
fields must be quantized as bosons, while half-integer spin fields must be quantized 
as fermions. Any attempt to do otherwise will lead to an inconsistency, such as the 
unbounded Hamiltonian we saw in (5.12). 

So how do we go about quantizing a field as a fermion? Recall that when we quantized 
the scalar field, the resulting particles obeyed bosonic statistics because the creation 
and annihilation operators satisfied the commutation relations, 



To have states obeying fermionic statistics, we need ant i- commutation relations, {A, B} = 
AB + BA. Rather than (5.3), we will ask that the spinor fields satisfy 



We still have the expansion (5.4) of ip and ip* in terms of b,b\c and c'. But now the 
same proof that led us to (5.5) tells us that 



(5.13) 



(5.14) 



{b^b^} = (2n) 3 5 rs 6^(p-q) 
= (2tt) 3 i 8"5®(j?-$ 



(5.15) 



with all other anti- commutators vanishing, 



o 



(5.16) 
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The calculation of the Hamiltonian proceeds as before, all the way through to the 
penultimate line (5.11). At that stage, we get 



(2 



7T 



3 



h s ^h s - rlr s] 
°P °p C P°p 



d 3 p 



(2VT) 



3 ^P 



bp%+cpcl-(2ir)W\0)] (5.17) 



The anti-commutators have saved us from the indignity of an unbounded Hamiltonian. 
Note that when normal ordering the Hamiltonian we now throw away a negative contri- 
bution — (2tt) 3 S^ 3 \0). In principle, this could partially cancel the positive contribution 
from bosonic fields. Cosmological constant problem anyone?! 

5.2.1 Fermi-Dirac Statistics 

Just as in the bosonic case, we define the vacuum |0) to satisfy, 

W = 4|0> = (5.18) 

Although b and c obey anti-commutation relations, the Hamiltonian (5.17) has nice 
commutation relations with them. You can check that 

[H, b% = -Efb r P and [H, b r J] = E p b r J 

[H, c r p ] = -E p c r p and [H, c r J] = (5.19) 

This means that we can again construct a tower of energy eigenstates by acting on the 
by and cy 

For example, we have the one-particle states 



vacuum by bV and c p to create particles and antiparticles, just as in the bosonic case. 



|p,r) = 6^|0) (5.20) 

The two particle states now satisfy 

\pi,ri,P2,r 2 ) = b Pl j b r yJ |0) = - \p2,r 2 ;pi,r 1 ) (5.21) 

confirming that the particles do indeed obey Fermi-Dirac statistics. In particular, we 
have the Pauli-Exclusion principle \p, r; p, r) = 0. Finally, if we wanted to be sure 
about the spin of the particle, we could act with the angular momentum operator 
(4.96) to confirm that a stationary particle \p = 0, r) does indeed carry intrinsic angular 
momentum 1/2 as expected. 

5.3 Dirac's Hole Interpretation 

"In this attempt, the success seems to have been on the side of Dirac rather 
than logic" p au \ % on Dirac 
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Let's pause our discussion to make a small historical detour. Dirac originally viewed 
his equation as a relativistic version of the Schrodinger equation, with ip interpreted 
as the wavefunction for a single particle with spin. To reinforce this interpretation, he 
wrote (i — m)ijj = as 

dib 

i-^- = -id -W + m(3i; = Hip (5.22) 
at 

where a = — 7°7 and = 7 . Here the operator H is interpreted as the one-particle 
Hamiltonian. This is a very different viewpoint from the one we now have, where if) 
is a classical field that should be quantized. In Dirac's view, the Hamiltonian of the 
system is H defined above, while for us the Hamiltonian is the field operator (5.17). 
Let's see where Dirac's viewpoint leads. 

With the interpretation of ip as a single-particle wavefunction, the plane-wave solu- 
tions (4.104) and (4.110) to the Dirac equation are thought of as energy eigenstates, 
with 

$ = v (p) e +ip - x = -Epijj (5.23) 

which look like positive and negative energy solutions. The spectrum is once again 
unbounded below; there are states v(p) with arbitrarily low energy —Ep. At first 
glance this is disastrous, just like the unbounded field theory Hamiltonian (5.12). Dirac 
postulated an ingenious solution to this problem: since the electrons are fermions (a 
fact which is put in by hand to Dirac's theory) they obey the Pauli-exclusion principle. 
So we could simply stipulate that in the true vacuum of the universe, all the negative 
energy states are filled. Only the positive energy states are accessible. These filled 
negative energy states are referred to as the Dirac sea. Although you might worry about 
the infinite negative charge of the vacuum, Dirac argued that only charge differences 
would be observable (a trick reminiscent of the normal ordering prescription we used 
for field operators). 

Having avoided disaster by floating on an infinite sea comprised of occupied negative 
energy states, Dirac realized that his theory made a shocking prediction. Suppose that 
a negative energy state is excited to a positive energy state, leaving behind a hole. 
The hole would have all the properties of the electron, except it would carry positive 
charge. After flirting with the idea that it may be the proton, Dirac finally concluded 
that the hole is a new particle: the positron. Moreover, when a positron comes across 
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an electron, the two can annihilate. Dirac had predicted anti-matter, one of the greatest 
achievements of theoretical physics. It took only a couple of years before the positron 
was discovered experimentally in 1932. 

Although Dirac's physical insight led him to the right answer, we now understand 
that the interpretation of the Dirac spinor as a single-particle wavefunction is not really 
correct. For example, Dirac's argument for anti-matter relies crucially on the particles 
being fermions while, as we have seen already in this course, anti-particles exist for 
both fermions and bosons. What we really learn from Dirac's analysis is that there is 
no consistent way to interpret the Dirac equation as describing a single particle. It is 
instead to be thought of as a classical field which has only positive energy solutions 
because the Hamiltonian (4.92) is positive definite. Quantization of this field then gives 
rise to both particle and anti-particle excitations. 

This from Julian Schwinger: 

"Until now, everyone thought that the Dirac equation referred directly to 
physical particles. Now, in field theory, we recognize that the equations 
refer to a sublevel. Experimentally we are concerned with particles, yet the 
old equations describe fields.... When you begin with field equations, you 
operate on a level where the particles are not there from the start. It is 
when you solve the field equations that you see the emergence of particles. " 

5.4 Propagators 

Let's now move to the Heisenberg picture. We define the spinors ip(x, t) at every point 
in spacetime such that they satisfy the operator equation 

We solve this by the expansion 

d 3 p 1 



dip 
~dt 



(5.24) 



ip{x) 
tp'(x) 



2 

E 

s=l 
2 

E 

s=l 



(2tt)3 y /2E p 

d 3 p 1 
(2tt)3 y /2E p L 



btu s (p)e- ip - x + c s Jv s (p)e +ip - x 



bJu 



(5.25) 



Let's now look at the anti-commutators of these fields. We define the fermionic prop- 
agator to be 



iS„# = {ip a (x),ipp(y)} 



(5.26) 
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In what follows we will often drop the indices and simply write iS(x—y) = {ip{x),ip(y)}, 
but you should remember that S(x — y) is a 4x4 matrix. Inserting the expansion (5.25), 
we have 

d 3 p 



[u s (p)u s (p)e- ip < x - y) + v s (p)v s (p)e +ip - {x ~ y) ] 



(2tt) 3 2E P 

J + m)e- ip - {x ~ y) + {4 - m)e +iHx ~ y) ] (5.27) 



(2tt) 3 2E P 

where to reach the final line we have used the outer product formulae (4.128) and 
(4.129). We can then write 

iS(x -y) = {i$ x + m)(D(x - y) - D(y - x)) (5.28) 

in terms of the propagator for a real scalar field D(x — y) which, recall, can be written 
as (2.90) 

^-^/IMs 8 """' (5 ' 29) 

Some comments: 

• For spacelike separated points (x — y) 2 < 0, we have already seen that D(x — y) — 
D(y — x) = 0. In the bosonic theory, we made a big deal of this since it ensured 
that 

[ct>{x)A{y)\ =0 (x-y) 2 <0 (5.30) 

outside the lightcone, which we trumpeted as proof that our theory was causal. 
However, for fermions we now have 

{Mz),Mv)} = (x-y) 2 <0 (5.31) 

outside the lightcone. What happened to our precious causality? The best that 
we can say is that all our observables are bilinear in fermions, for example the 
Hamiltonian (5.17). These still commute outside the lightcone. The theory re- 
mains causal as long as fermionic operators are not observable. If you think this is 
a little weak, remember that no one has ever seen a physical measuring apparatus 
come back to minus itself when you rotate by 360 degrees! 



- 113 - 



• At least away from singularities, the propagator satisfies 



(i0 x -m)S(x-y) = O (5.32) 

which follows from the fact that (0 X + m 2 )D(x — y) = using the mass shell 
condition p 2 = m 2 . 

5.5 The Feynman Propagator 

By a similar calculation to that above, we can determine the vacuum expectation value, 

(o\Mx)Mv)\o) = J ^w^ +mUe ~ w ' ix ~ v) 

(o\Mv)M*) |o> = / 0j-3^- m ^ e+ip<x ~ y) ( 5 - 33 ) 

We now define the Feynman propagator Sf{x — y), which is again a 4 x 4 matrix, as 
the time ordered product, 

[ (0| - if)(y)il>(x) |0) y° > x° 

Notice the minus sign! It is necessary for Lorentz invariance. When (x—y) 2 < 0, there is 
no invariant way to determine whether x° > y° or y° > x°. In this case the minus sign is 
necessary to make the two definitions agree since {tp(x), ^(y)} = outside the lightcone. 
We have the 4-momentum integral representation for the Feynman propagator, 

J (2vr) 4 p z — m z + xe 

which satisfies {i$ x — i7i)Sf(x — y) — i5^(x — y), so that Sf is a Green's function for 
the Dirac operator. 

The minus sign that we see in (5.34) also occurs for any string of operators inside 
a time ordered product T(. . .). While bosonic operators commute inside T, fermionic 
operators anti-commute. We have this same behaviour for normal ordered products as 
well, with fermionic operators obeying : ipiip2 '■= — '■ '■■ With the understanding 

that all fermionic operators anti-commute inside T and ::, Wick's theorem proceeds 
just as in the bosonic case. We define the contraction 

ip(x)ip{y) = T(ip(x)%p{y)) - : ip(x)ip{y) : = S F {x - y) (5.36) 
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5.6 Yukawa Theory 

The interaction between a Dirac fermion of mass m and a real scalar field of mass fi is 
governed by the Yukawa theory, 

C = Id^d^ - |/i 2 2 + Qii'fdp - m)ip - A#V> (5.37) 

which is the proper version of the baby scalar Yukawa theory we looked at in Section 3. 
Couplings of this type appear in the standard model, between fermions and the Higgs 
boson. In that context, the fermions can be leptons (such as the electron) or quarks. 

Yukawa originally proposed an interaction of this type as an effective theory of nuclear 
forces. With an eye to this, we will again refer to the particles as mesons, and the 
ip particles as nucleons. Except, this time, the nucleons have spin. (This is still not 
a particularly realistic theory of nucleon interactions, not least because we're omitting 
isospin. Moreover, in Nature the relevant mesons are pions which are pseudoscalars, so 
a coupling of the form (pijj^ifj would be more appropriate. We'll turn to this briefly in 
Section 5.7.3). 

Note the dimensions of the various fields. We still have [(f)] = 1, but the kinetic 
terms require that [ip] = 3/2. Thus, unlike in the case with only scalars, the coupling 
is dimensionless: [A] = 0. 

We'll proceed as we did in Section 3, firstly computing the amplitude of a particular 
scattering process then, with that calculation as a guide, writing down the Feynman 
rules for the theory. We start with: 

5.6.1 An Example: Putting Spin on Nucleon Scattering 

Let's study ipip — > x/jx/j scattering. This is the same calculation we performed in Section 
(3.3.3) except now the fermions have spin. Our initial and final states are 

\i) = y^EpE^b^ |0> = \p : s;q,r) 

\f) = v/4Ep^ bp bp |0> = \p', s'; q', r') (5.38) 

We need to be a little cautious about minus signs, because the &''s now anti-commute. 
In particular, we should be careful when we take the adjoint. We have 

(/| = V^BpV (0|&f# (5.39) 
We want to calculate the order A 2 terms from the S-matrix element (/| S — 1 

J d A Xl d 4 x 2 T (^(xO^(xi)0(zi) ^(x 2 Mx 2 )0(x 2 )) (5.40) 
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where, as usual, all fields are in the interaction picture. Just as in the bosonic calcula- 
tion, the contribution to nucleon scattering comes from the contraction 



: ip(xi)if)(x 1 )ij;(x 2 )'4)(x 2 ) : <p{xi)(j)(x 2 ) (5.41) 

We just have to be careful about how the spinor indices are contracted. Let's start 
by looking at how the fermionic operators act on |z). We expand out the ip fields, 
leaving the ip fields alone for now. We may ignore the c 1 pieces in ip since they give no 
contribution at order A 2 . We have 

f d 3 k d^k 

: feM* 2 ) : bp bp |0) = - / 1 2 $( Xl ) ■ u m (h)] ${x a ) ■ u n (k 2 )} 

iki-xi—ik2-X2 

b?blW\0) (5-42) 



e — = b?W b s Jb r J 



where we've used square brackets [•] to show how the spinor indices are contracted. The 
minus sign that sits out front came from moving ip{ x i) P as t ^{x^. Now anti-commuting 
the 6's past the fet's, we get 



1 



([ip(xi) ■ u r (q)] [ip(x 2 ) ■ u s 



s (tt\\ e - l v-x-2-n-x\ 



2 E^Eq 

- $( Xl ) ■ u s (p)] ${x 2 ) ■ u r (q)}e-^-^) |0) (5.43) 

Note, in particular, the relative minus sign that appears between these two terms. Now 
let's see what happens when we hit this with (/|. We look at 



(0| % % ${ Xl ) ■ u r (q)] $(x 2 ) ■ U s {p)\ |0) = \ [u s \p') ■ U r (q)) [u r \q') ■ u s (p)] 



+ip'-X2+iq'-xi 

U r '(q')-U r (q)] [u s '{p')-u l 



2y Ej?Eq. 

The [^(^i) • u s (p)} [^j{x 2 ) ■ u r (q)} term in (5.43) doubles up with this, cancelling the 
factor of 1/2 in front of (5.40). Meanwhile, the l/\f~E terms cancel the relativistic 
state normalization. Putting everything together, we have the following expression for 

U\s-i\i) 

- [u s '(p') ■ u r (q)] [u r \q') ■ u '(^)] e «i-(p'-ff)-M* a -(«'-p) > 
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where we've put the <fi propagator back in. Performing the integrals over x\ and X2, 
this becomes, 

l f [u s' {f) . ^ ^ } . _ + fc)5 (4) (p , _ _ fc) 

7 — /i + «e V 

- [fZ s '(p") • « r (^)] K'(<f) • u'(p)]£ (4) (p' -q + k)5 {4 \q' -p-kj) 

And we're almost there! Finally, writing the S-matrix element in terms of the amplitude 
in the usual way, (f\S — 1 \i) = iA(27r) 4 8^(p + q — p' — q'), we have 



A=(-i\) 



u s '(p') ■ u s (p)] [u r '(q') ■ u r (q)} [u s '(p') ■ u r (q)] \u r ' (q 1 ) ■ u* 



(p 1 — p) 2 — /i 2 + ie (q' — p) 2 — /i 2 + ie 

which is our final answer for the amplitude. 

5.7 Feynman Rules for Fermions 

It's important to bear in mind that the calculation we just did kind of blows. Thankfully 
the Feynman rules will once again encapsulate the combinatoric complexities and make 
life easier for us. The rules to compute amplitudes are the following 

• To each incoming fermion with momentum p and spin r, we associate a spinor 
u r {p). For outgoing fermions we associate u r {p). 



p > /i -- 3 \ P > 

°"'^> < > — «» 



Figure 21: An incoming fermion Figure 22: An outgoing fermion 

• To each incoming anti-fermion with momentum p and spin r, we associate a spinor 
v r {p). For outgoing anti-fermions we associate v r {p). 





np l < P ^ yep) 



Figure 23: An incoming anti-fermion Figure 24: An outgoing anti-fermion 

• Each vertex gets a factor of —iX. 
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• Each internal line gets a factor of the relevant propagator. 



p 



for scalars 



p 



> 



P 



> 2 — fi 2 + ie 



for fermions 



(5.44) 



P 



i 2 — m 2 + ie 



The arrows on the fermion lines must flow consistently through the diagram (this 
ensures fermion number conservation). Note that the fermionic propagator is a 
4x4 matrix. The matrix indices are contracted at each vertex, either with further 
propagators, or with external spinors u, u, v or v. 

• Impose momentum conservation at each vertex, and integrate over undetermined 
loop momenta. 

• Add extra minus signs for statistics. Some examples will be given below. 
5.7.1 Examples 

Let's run through the same examples we did for the scalar Yukawa theory. Firstly, we 
have 

Nucleon Scattering 

For the example we worked out previously, the two lowest order Feynman diagrams are 
shown in Figure 25. We've drawn the second Feynman diagram with the legs crossed 



to emphasize the fact that it picks up a minus sign due to statistics. (Note that the 
way the legs point in the Feynman diagram doesn't tell us the direction in which the 
particles leave the scattering event: the momentum label does that. The two diagrams 
above are different because the incoming legs are attached to different outgoing legs). 
Using the Feynman rules we can read off the amplitude. 




Figure 25: The two Feynman diagrams for nucleon scattering 



A={-i\) 2 (t 



[u s '(p') ■ u s (p)} [u r '(q') ■ u r (q)} \u s \p') ■ u r (q)} \u r ' (q 1 ) ■ u s (p)] 
(j) — p') 2 — jj? (p — q') 2 — jj? 



(5.45) 
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The denominators in each term are due to the meson propagator, with the momen- 
tum determined by conservation at each vertex. This agrees with the amplitude we 
computed earlier using Wick's theorem. 

Nucleon to Meson Scattering 

Let's now look at tpip — y <fxp. The two lowest order Feynman diagrams are shown in 
Figure 26. 




Figure 26: The two Feynman diagrams for nucleon to meson scattering 



Applying the Feynman rules, we have 

a i -\\2 fv r (q)b'*(p l * -P'a) +m]u s (p) v r (q)[^(p^- q')+m]u s 



(p — p') 2 — m 2 (p — q') 2 — m 2 

Since the internal line is now a fermion, the propagator contains J^ipn — p(J +m factors. 
This is a 4 x 4 matrix which sits on the top, sandwiched between the two external 
spinors. Now the exchange statistics applies to the final meson states. These are 
bosons and, correspondingly, there is no relative minus sign between the two diagrams. 

Nucleon-Anti-Nucleon Scattering 

For ipip — y ifjip, the two lowest order Feynman diagrams are of two distinct types, just 
like in the bosonic case. They are shown in Figure 27. The corresponding amplitude 
is given by, 

A={ ~ lX) { (p-jy) 2 -, 2 + (p + gy-^ + ie J (5 - 46) 

As in the bosonic diagrams, there is again the difference in the momentum dependence 
in the denominator. But now the difference in the diagrams is also reflected in the 
spinor contractions in the numerator. 
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Figure 27: The two Feynman diagrams for nucleon-anti-nucleon scattering 

More subtle are the minus signs. The fermionic statistics mean that the first diagram 
has an extra minus sign relative to the ipip scattering of Figure 25. Since this minus sign 
will be important when we come to figure out whether the Yukawa force is attractive 
or repulsive, let's go back to basics and see where it comes from. The initial and final 
states for this scattering process are 

\i) = y/AE^bpc^ |0> = \p,s;q,r) 

\f) = v/4Ep^ bp cp |0> ee ]?, s'; q', r>) (5.47) 

The ordering of and c* in these states is crucial and reflects the scattering i/jip — > ipij}, 
as opposed to tptp — > tptp which would differ by a minus sign. The first diagram in 
Figure 27 comes from the term in the perturbative expansion, 

(f\ : $(z 2 )i/>(x 2 ) : bp |0> ~ (f\ [v m (h) ■ VOrr)] $(z 2 ) ■ u B (^)]c^6|6J t 4 t |0> 

where we've neglected a bunch of objects in this equation like J d A ki and exponential 
factors because we only want to keep track of the minus signs. Moving the annihilation 
operators past the creation operators, we have 

+ (/I W\q) ■ i>(xi)\ mx 2 ) ■ u s (p)} |0> (5.48) 

Repeating the process by expanding out the i}j{ x i) an d tp(x2) fields and moving them 
to the left to annihilate (/|, we have 

(0| 44cfb^ [v r (q) ■ v m (h)} [u n (l 2 ) ■ u s m |0) ~ ~[v r (<f) ■ v'tf)] [u s ' ' (?) ■ u\p)} 

where the minus sign has appeared from anti-commuting c™^ past b s L. This is the 
overall minus sign found in (5.46). One can also follow similar contractions to compute 
the second diagram in Figure 27. 

Meson Scattering 
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Finally, we can also compute the scattering of 00 — > <fxj> 
which, as in the bosonic case, picks up its leading contribution 
at one- loop. The amplitude for the diagram shown in the figure 
is 



A 



V 



%A 



i-ixy 



d A k 



Tr 



{[ + m 



(2vr) 4 (k 2 - m 2 + it) ((k + p\) 2 - m 2 + it) 



x 



H + A - A + 



in 



—< — 

Figure 28: 

+ m 



((k + p{ — pi) 2 — m 2 + ie) ((k — p'^) 2 — m 2 + ie) 



Notice that the high momentum limit of the integral is J d 4 k/k 4 , which is no longer 
finite, but diverges logarithmically. You will have to wait until next term to make sense 
of this integral. 



There's an overall minus sign sitting in front of this amplitude. This is a generic 
feature of diagrams with fermions running in loops: each fermionic loop in a diagram 
gives rise to an extra minus sign. We can see this rather simply in the diagram 



which involves the expression 



$ a (x)$ a (x)$p(y)-il)p(y) = -if)p{y)i/; a (x) ijj a {x)ijjp{y) 

= -Tr (S F (y - x)S F (x - y)) 



After passing the fermion fields through each other, a minus sign appears, sitting in 
front of the two propagators. 



5.7.2 The Yukawa Potential Revisited 

We saw in Section 3.5.2, that the exchange of a real scalar particle gives rise to a 
universally attractive Yukawa potential between two spin zero particles. Does the 
same hold for the spin 1/2 particles? 

Recall that the strategy to compute the potential is to take the non-relativistic limit 
of the scattering amplitude, and compare with the analogous result from quantum 
mechanics. Our new amplitude now also includes the spinor degrees of freedom u(p) 
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and v(p). In the non-relativistic limit, p — > (m,p), and 

u(p) = 




v(p)= [ v " _ (5.49) 



In this limit, the spinor contractions in the amplitude for ipip — > ipip scattering (5.45) 
become u s ' ■ u s = 2m5 ss ' and the amplitude is 



Xs sXr r Xs rXr s 

-i(-iX) 2 (2m) ( — — - — — ) (5.50) 



The S symbols tell us that spin is conserved in the non-relativistic limit, while the 
momentum dependence is the same as in the bosonic case, telling us that once again 
the particles feel an attractive Yukawa potential, 

Mr) = -^4- (5-51) 

Repeating the calculation for ipip — > ifjip, there are two minus signs which cancel each 
other. The first is the extra overall minus sign in the scattering amplitude (5.46), 
due to the fermionic nature of the particles. The second minus sign comes from the 
non-relativistic limit of the spinor contraction for anti-particles in (5.46), which is 
v s ' ■ v s = —2m5 ss '. These two signs cancel, giving us once again an attractive Yukawa 
potential (5.51). 

5.7.3 Pseudo-Scalar Coupling 

Rather than the standard Yukawa coupling, we could instead consider 

£ Yuk = -A0^ 7 5 ^ (5.52) 
This still preserves parity if is a pseudoscalar, i.e. 

P : (f)(x,t) -<f>(-x,t) (5.53) 

We can compute in this theory very simply: the Feynman rule for the interaction vertex 
is now changed to a factor of — iX 1 -/ 5 . For example, the Feynman diagrams for ipip — > ipip 
scattering are again given by Figure 25, with the amplitude now 

2 / [M s '(p / )7 5 M' s (p)] [u r ' (q')^u T (q)] [u s '(p')^u r (q)} [u r ' (q')^ 5 u s (p)}\ 



A=(-i\) 



(p — p') 2 — p? (p — q') 2 — fi 



2 
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We could again try to take the non-relativistic limit for this amplitude. But this 
time, things work a little differently. Using the expressions for the spinors (5.49), we 
have u s ' ^u s — > in the non-relativistic limit. To find the non-relativistic amplitude, 
we must go to next to leading order. One can easily check that u s ' {p')^u s {p) — > 
m^ s ' T (p — p') ■ a^ s . So, in the non-relativistic limit, the leading order amplitude arising 
from pseudoscalar exchange is given by a spin-spin coupling, 



— > +im(—i\) 2 



r T (p-p')-^ s ]r T (p-p')-^ 
(p-p') 2 + /i 2 



(5.54) 
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6. Quantum Electrodynamics 

In this section we finally get to quantum electrodynamics (QED), the theory of light 
interacting with charged matter. Our path to quantization will be as before: we start 
with the free theory of the electromagnetic field and see how the quantum theory gives 
rise to a photon with two polarization states. We then describe how to couple the 
photon to fermions and to bosons. 

6.1 Maxwell's Equations 

The Lagrangian for Maxwell's equations in the absence of any sources is simply 

C = - l -F„ v F^ (6.1) 
where the field strength is defined by 

= d„A v - d v A„ (6.2) 
The equations of motion which follow from this Lagrangian are 

^sfe)--^- (6 - 3) 

Meanwhile, from the definition of F^ u , the field strength also satisfies the Bianchi 
identity 

dxF^ + d fl F u \ + d v F x „ = (6.4) 

To make contact with the form of Maxwell's equations you learn about in high school, 
we need some 3- vector notation. If we define = (0, A), then the electric field E and 
magnetic field B are defined by 

dA 

E = -V0 - — and B = V x A (6.5) 
which, in terms of F^, becomes 



E x E y E z 
-E z —By B x 



The Bianchi identity (6.4) then gives two of Maxwell's equations, 



dB 

V ■ B = and — = -V x E (6.7) 
dt v ; 
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These remain true even in the presence of electric sources. Meanwhile, the equations 
of motion give the remaining two Maxwell equations, 



dE 

V -E = and - = Vx5 (6.8) 
dt v ; 

As we will see shortly, in the presence of charged matter these equations pick up extra 
terms on the right-hand side. 

6.1.1 Gauge Symmetry 

The massless vector field has 4 components, which would naively seem to tell us that 
the gauge field has 4 degrees of freedom. Yet we know that the photon has only two 
degrees of freedom which we call its polarization states. How are we going to resolve 
this discrepancy? There are two related comments which will ensure that quantizing 
the gauge field gives rise to 2 degrees of freedom, rather than 4. 

• The field Aq has no kinetic term Aq in the Lagrangian: it is not dynamical. This 
means that if we are given some initial data A\ and A% at a time to, then the field 
Aq is fully determined by the equation of motion V • E = which, expanding out, 
reads 

V 2 A + V • — = (6.9) 

This has the solution 

So A Q is not independent: we don't get to specify A Q on the initial time slice. It 
looks like we have only 3 degrees of freedom in A^ rather than 4. But this is still 
one too many. 

• The Lagrangian (6.3) has a very large symmetry group, acting on the vector 
potential as 

A^x) -> A^x) + d^x) (6.11) 

for any function A(x). We'll ask only that X(x) dies off suitably quickly at spatial 
x — > oo. We call this a gauge symmetry. The field strength is invariant under the 
gauge symmetry: 

-> d„{A v + d v \) ~ d v {A» + d M A) = (6.12) 
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So what are we to make of this? We have a theory with an infinite number of 
symmetries, one for each function X(x). Previously we only encountered symme- 
tries which act the same at all points in spacetime, for example ip — > e ta ip for a 
complex scalar field. Noether's theorem told us that these symmetries give rise 
to conservation laws. Do we now have an infinite number of conservation laws? 



The answer is no! Gauge symmetries have a very different interpretation than 
the global symmetries that we make use of in Noether's theorem. While the 
latter take a physical state to another physical state with the same properties, 
the gauge symmetry is to be viewed as a redundancy in our description. That is, 
two states related by a gauge symmetry are to be identified: they are the same 
physical state. (There is a small caveat to this statement which is explained in 
Section 6.3.1). One way to see that this interpretation is necessary is to notice 
that Maxwell's equations are not sufficient to specify the evolution of A^. The 
equations read, 



[ri,u{d p d p )-d,d u ] A» = 



(6.13) 



But the operator [rj flI/ (d p d p ) — c^ct] is not invertible: it annihilates any function of 
the form d^X. This means that given any initial data, we have no way to uniquely 
determine A p at a later time since we can't distinguish between A^ and A^ + d^X. 
This would be problematic if we thought that A^ is a physical object. However, 
if we're happy to identify A p and A p + d^X as corresponding to the same physical 
state, then our problems disappear. 

Since gauge invariance is a redundancy of the system, 
we might try to formulate the theory purely in terms of 
the local, physical, gauge invariant objects E and B. This 
is fine for the free classical theory: Maxwell's equations 
were, after all, first written in terms of E and B. But it is 
not possible to describe certain quantum phenomena, such 
as the Aharonov-Bohm effect, without using the gauge 
potential A p . We will see shortly that we also require the 
gauge potential to describe classically charged fields. To 
describe Nature, it appears that we have to introduce quantities A p that we can never 
measure. 

The picture that emerges for the theory of electromagnetism is of an enlarged phase 
space, foliated by gauge orbits as shown in the figure. All states that lie along a given 




Figure 29: 
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line can be reached by a gauge transformation and are identified. To make progress, 
we pick a representative from each gauge orbit. It doesn't matter which representative 
we pick — after all, they're all physically equivalent. But we should make sure that we 
pick a "good" gauge, in which we cut the orbits. 

Different representative configurations of a physical state are called different gauges. 
There are many possibilities, some of which will be more useful in different situations. 
Picking a gauge is rather like picking coordinates that are adapted to a particular 
problem. Moreover, different gauges often reveal slightly different aspects of a problem. 
Here we'll look at two different gauges: 

• Lorentz Gauge: d^A^ = 

To see that we can always pick a representative configuration satisfying d^A^ = 0, 
suppose that we're handed a gauge field A'^ satisfying d^A'Y = f(x). Then we 
choose A^ = v4' M + <9 M A, where 

d^X = -f (6.14) 

This equation always has a solution. In fact this condition doesn't pick a unique 
representative from the gauge orbit. We're always free to make further gauge 
transformations with d^d^X = 0, which also has non-trivial solutions. As the 
name suggests, the Lorentz gauge 3 has the advantage that it is Lorentz invariant. 

• Coulomb Gauge: V • A = 

We can make use of the residual gauge transformations in Lorentz gauge to pick 
V • A = 0. (The argument is the same as before). Since A$ is fixed by (6.10), we 
have as a consequence 

A = (6.15) 

(This equation will no longer hold in Coulomb gauge in the presence of charged 
matter). Coulomb gauge breaks Lorentz invariance, so may not be ideal for some 
purposes. However, it is very useful to exhibit the physical degrees of freedom: 
the 3 components of A satisfy a single constraint: V ■ A = 0, leaving behind just 
2 degrees of freedom. These will be identified with the two polarization states of 
the photon. Coulomb gauge is sometimes called radiation gauge. 

3 Named after Lorenz who had the misfortune to be one letter away from greatness. 
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6.2 The Quantization of the Electromagnetic Field 

In the following we shall quantize free Maxwell theory twice: once in Coulomb gauge, 
and again in Lorentz gauge. We'll ultimately get the same answers and, along the way, 
see that each method comes with its own subtleties. 

The first of these subtleties is common to both methods and comes when computing 
the momentum 7r M conjugate to A^, 

^ = -^=0 
dA 

BC 

7r* = —7- — —F 0i = (6.16) 
dAi 

so the momentum 7r° conjugate to A vanishes. This is the mathematical consequence of 
the statement we made above: A is not a dynamical field. Meanwhile, the momentum 
conjugate to A, is our old friend, the electric field. We can compute the Hamiltonian, 



H = J d A x ix l Ai - C 

= J d 3 x \E ■ E+\B ■ B - A)(V • E) (6.17) 

So A acts as a Lagrange multiplier which imposes Gauss' law 

V-£ = (6.18) 

— * 

which is now a constraint on the system in which A are the physical degrees of freedom. 
Let's now see how to treat this system using different gauge fixing conditions. 

6.2.1 Coulomb Gauge 

In Coulomb gauge, the equation of motion for A is 

d li d"l=Q (6.19) 

which we can solve in the usual way, 



d 3 p 

n = J i 

with pi = \p\ 2 . The constraint V • A = tells us that £ must satisfy 



fp = (6.21) 
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which means that £ is perpendicular to the direction of motion p. We can pick £ (p) to 
be a linear combination of two orthonormal vectors e r , r = 1,2, each of which satisfies 
trip) • P — and 

£(p) • e a (p) = <f„ r,s = l,2 (6.22) 

These two vectors correspond to the two polarization states of the photon. It's worth 
pointing out that you can't consistently pick a continuous basis of polarization vectors 
for every value of p because you can't comb the hair on a sphere. But this topological 
fact doesn't cause any complications in computing QED scattering processes. 

To quantize we turn the Poisson brackets into commutators. Naively we would write 

[A t (x), A J (y)] = [E\x),E^(y)] = 

[A l {x),Ei{y)]=i5{5^\x-y) (6.23) 

But this can't quite be right, because it's not consistent with the constraints. We 
still want to have V • A = V • E = 0, now imposed on the operators. But from the 
commutator relations above, we see 

[V • A(x),V ■ E(y)\ = tV 2 5 {3 \x-y) ^ (6.24) 

What's going on? In imposing the commutator relations (6.23) we haven't correctly 
taken into account the constraints. In fact, this is a problem already in the classical 
theory, where the Poisson bracket structure is already altered 4 . The correct Poisson 
bracket structure leads to an alteration of the last commutation relation, 



[A t (x),E J (y)] = 




(6.25) 



To see that this is now consistent with the constraints, we can rewrite the right-hand 
side of the commutator in momentum space, 

[A(x), E M ] = i J (*« - P) ^ ( 6 - 26 ) 

which is now consistent with the constraints, for example 

frMx), em\ = ^ / - p) to e * (aH0 = ( 6 - 27 ) 

4 For a nice discussion of the classical and quantum dynamics of constrained systems, see the small 
book by Paul Dirac, "Lectures on Quantum Mechanics" 
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We now write A in the usual mode expansion, 



A{x) 



E(x) 



d 3 p 1 _ 



d 3 p 



-i, 




'V 2 : j 

where, as before, the polarization vectors satisfy 

e r (p) -p = and e r (p) ■ e s {p) = 5 : 



a r p e ip - x + a'Je 



a r ^ s - a r Je~ if£ 



(6.28) 



(6.29) 



It is not hard to show that the commutation relations (6.25) are equivalent to the usual 
commutation relations for the creation and annihilation operators, 







[a£, a s J] = (27tf5 rs 6®(p- q) (6.30) 
where, in deriving this, we need the completeness relation for the polarization vectors, 



r=l 



pip] 

w 



(6.31) 



You can easily check that this equation is true by acting on both sides with a basis of 
vectors (ei (p) , e 2 (p) , p) . 

We derive the Hamiltonian by substituting (6.28) into (6.17). The last term vanishes 
in Coulomb gauge. After normal ordering, and playing around with e r polarization 
vectors, we get the simple expression 

2 



H 



d 3 p 
(2^p 



|pi£ 



r f r 



(6.32) 



r=l 



The Coulomb gauge has the advantage that the physical degrees of freedom are man- 
ifest. However, we've lost all semblance of Lorentz invariance. One place where this 
manifests itself is in the propagator for the fields A^x) (in the Heisenberg picture). In 
Coulomb gauge the propagator reads 

d 4 p i 



Dg(x-y) = <0|T^(x)^(i/)|0> 



(2vr) 4 



PiPj 



-ip-(x-y) 



(6.33) 



The tr superscript on the propagator refers to the "transverse" part of the photon. 
When we turn to the interacting theory, we will have to fight to massage this propagator 
into something a little nicer. 
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6.2.2 Lorentz Gauge 

We could try to work in a Lorentz invariant fashion by imposing the Lorentz gauge 
condition d^A^ = 0. The equations of motion that follow from the action are then 

d^A" = (6.34) 

Our approach to implementing Lorentz gauge will be a little different from the method 
we used in Coulomb gauge. We choose to change the theory so that (6.34) arises directly 
through the equations of motion. We can achieve this by taking the Lagrangian 

C = -\F, v F>"- l -{d,A»Y (6.35) 
The equations of motion coming from this action are 

d^ y + d u (d fjL A^) = d^A u = (6.36) 
(In fact, we could be a little more general than this, and consider the Lagrangian 



L = -\F vlt F* w --^(<Vn 2 (6-37) 
la 



with arbitrary a and reach similar conclusions. The quantization of the theory is 
independent of a and, rather confusingly, different choices of a are sometimes also 
referred to as different "gauges" . We will use a = 1 , which is called "Feynman gauge" . 
The other common choice, a = 0, is called "Landau gauge".) 

Our plan will be to quantize the theory (6.36), and only later impose the constraint 
d^A^ = in a suitable manner on the Hilbert space of the theory. As we'll see, we will 
also have to deal with the residual gauge symmetry of this theory which will prove a 
little tricky. At first, we can proceed very easily, because both 7r° and tt 1 are dynamical: 

OA, 
3C 

w i = = &A° - A 1 (6.38) 
dAi 

Turning these classical fields into operators, we can simply impose the usual commu- 
tation relations, 

iA fl (x),A u (y)] = K(x),7r»(y)] = 
[A,{x)^M}=^^^\x-y) (6.39) 
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and we can make the usual expansion in terms of creation and annihilation operators 
and 4 polarization vectors (e At ) A , with A = 0, 1, 2, 3. 



Iwf\f^ f^ )X ® H ^ ~ °i te "*1 (6 - 40) 



Note that the momentum 7r M comes with a factor of rather than the familiar (— i) 
that we've seen so far. This can be traced to the fact that the momentum (6.38) for the 
classical fields takes the form 7r M = — + . . .. In the Heisenberg picture, it becomes 
clear that this descends to in the definition of momentum. 

There are now four polarization 4- vectors e x (p), instead of the two polarization 3- 
vectors that we met in the Coulomb gauge. Of these four 4-vectors, we pick e° to be 
timelike, while e 1 ' 2,3 are spacelike. We pick the normalization 

6 A • e x ' = V xx ' (6.41) 

which also means that 

M A i^f V\x> = V„u (6.42) 

The polarization vectors depend on the photon 4-momentum p = (\p\,p). Of the two 
spacelike polarizations, we will choose e 1 and e 2 to lie transverse to the momentum: 

e 1 .p = e 2 -p = (6.43) 

The third vector e 3 is the longitudinal polarization. For example, if the momentum lies 
along the x 3 direction, so p ~ (1, 0, 0, 1), then 

'■(!)• '"-(:)• '"-(!) •'-(!) (6 ' 44) 

For other 4-momenta, the polarization vectors are the appropriate Lorentz transforma- 
tions of these vectors, since (6.43) are Lorentz invariant. 

We do our usual trick, and translate the field commutation relations (6.39) into those 
for creation and annihilation operators. We find [a~, a^\ = [at , at ' ] = and 

= (2n) 3 5^(p- q) (6.45) 
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The minus signs here are odd to say the least! For spacelike A = 1,2,3, everything 
looks fine, 

[4, «f] = (2?r) 3 5 (3) (p - q) A, A' = 1, 2, 3 (6.46) 

But for the timelike annihilation and creation operators, we have 

[a%a^] = -(2n) 3 5^(p-q) (6.47) 

This is very odd! To see just how strange this is, we take the Lorentz invariant vacuum 
|0) defined by 

4 |0) = (6.48) 

Then we can create one-particle states in the usual way, 

\p,X) = a y\0) (6.49) 

For spacelike polarization states, A = 1,2,3, all seems well. But for the timelike 
polarization A = 0, the state \p, 0) has negative norm, 

(p, 0| q, 0) = (0| a° P a° f |0> = -(2vr) 3 8®@- q) (6.50) 

Wtf? That's very very strange. A Hilbert space with negative norm means negative 
probabilities which makes no sense at all. We can trace this negative norm back to the 
wrong sign of the kinetic term for A in our original Lagrangian: C = + ^A 2 — ^Aq + . . .. 

At this point we should remember our constraint equation, d^A^ 1 = 0, which, until 
now, we've not imposed on our theory. This is going to come to our rescue. We will see 
that it will remove the timelike, negative norm states, and cut the physical polarizations 
down to two. We work in the Heisenberg picture, so that 

d^A" = (6.51) 

makes sense as an operator equation. Then we could try implementing the constraint 
in the quantum theory in a number of different ways. Let's look at a number of 
increasingly weak ways to do this 

• We could ask that d^A^ = is imposed as an equation on operators. But this 
can't possibly work because the commutation relations (6.39) won't be obeyed 
for 7T° = — d u A^. We need some weaker condition. 



- 133 - 



• We could try to impose the condition on the Hilbert space instead of directly 
on the operators. After all, that's where the trouble lies! We could imagine that 
there's some way to split the Hilbert space up into good states |^) and bad states 
that somehow decouple from the system. With luck, our bad states will include 
the weird negative norm states that we're so disgusted by. But how can we define 
the good states? One idea is to impose 

d^\ty} = (6.52) 

on all good, physical states \^/). But this can't work either! Again, the condition 
is too strong. For example, suppose we decompose A^x) = A+(x) + A~(x) with 

w^wfimt: 14 ^" (6 ' 53) 

Then, on the vacuum A+ 1 0) = automatically, but d^A' |0) ^ 0. So not even 
the vacuum is a physical state if we use (6.52) as our constraint 

• Our final attempt will be the correct one. In order to keep the vacuum as a good 
physical state, we can ask that physical states are defined by 

0MJ |$) = (6.54) 

This ensures that 



(V'ld^lty) = (6.55) 

so that the operator d^A^ has vanishing matrix elements between physical states. 
Equation (6.54) is known as the Gupta-Bleuler condition. The linearity of the 
constraint means that the physical states span a physical Hilbert space % p h ys - 

So what does the physical Hilbert space "H p h ys look like? And, in particular, have we 
rid ourselves of those nasty negative norm states so that H p h ys has a positive definite 
inner product defined on it? The answer is actually no, but almost! 

Let's consider a basis of states for the Fock space. We can decompose any element 
of this basis as |^) = \ipx) \<f>), where \iPt) contains only transverse photons, created by 
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a^ 2 \ while |0) contains the timelike photons created by aV and longitudinal photons 
created by . The Gupta-Bleuler condition (6.54) requires 

(4-4)10 = (6.56) 

This means that the physical states must contain combinations of timelike and longi- 
tudinal photons. Whenever the state contains a timelike photon of momentum p, it 
must also contain a longitudinal photon with the same momentum. In general \<ft) will 
be a linear combination of states \<p n ) containing n pairs of timelike and longitudinal 
photons, which we can write as 

oo 

|0)=^C n |0 n ) (6.57) 

n=0 

where |0o) = |0) is simply the vacuum. It's not hard to show that although the condition 
(6.56) does indeed decouple the negative norm states, all the remaining states involving 
timelike and longitudinal photons have zero norm 

(4>m\ 4>n) = S n0 S m0 (6.58) 

This means that the inner product on "H p h ys is positive semi-definite. Which is an 
improvement. But we still need to deal with all these zero norm states. 

The way we cope with the zero norm states is to treat them as gauge equivalent 
to the vacuum. Two states that differ only in their timelike and longitudinal photon 
content, |0 n ) with n > 1 are said to be physically equivalent. We can think of the gauge 
symmetry of the classical theory as descending to the Hilbert space of the quantum 
theory. Of course, we can't just stipulate that two states are physically identical unless 
they give the same expectation value for all physical observables. We can check that 
this is true for the Hamiltonian, which can be easily computed to be 




(6.59) 



But the condition (6.56) ensures that 44 1^ = ^ a V a °p 1^ so ^ e con ^ r i" 
butions from the timelike and longitudinal photons cancel amongst themselves in the 
Hamiltonian. This also renders the Hamiltonian positive definite, leaving us just with 
the contribution from the transverse photons as we would expect. 

In general, one can show that the expectation values of all gauge invariant operators 
evaluated on physical states are independent of the coefficients C n in (6.57). 
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Propagators 

Finally, it's a simple matter to compute the propagator in Lorentz gauge. It is given 
by 

{V\TA»{x)A v {y)\V) = J 0ji^- e ^ ipix ~ y) (6-60) 

This is a lot nicer than the propagator we found in Coulomb gauge: in particular, it's 
Lorentz invariant. We could also return to the Lagrangian (6.37). Had we pushed 
through the calculation with arbitrary coefficient a, we would find the propagator, 

(0| T A,(x)A„(y) |0> = J ^ + (a - 1)^) e -*(-v) (6.61) 

6.3 Coupling to Matter 

Let's now build an interacting theory of light and matter. We want to write down 
a Lagrangian which couples to some matter fields, either scalars or spinors. For 
example, we could write something like 

C = -\F, u F^-fA fl (6.62) 

where j M is some function of the matter fields. The equations of motion read 

d,F^ = f (6.63) 

so, for consistency, we require 

d^f = (6.64) 

In other words, must be a conserved current. But we've got lots of those! Let's look 
at how we can couple two of them to electromagnetism. 

6.3.1 Coupling to Fermions 

The Dirac Lagrangian 

C = tp(i $ — m)ip (6.65) 

has an internal symmetry ifi — > e~ lot ip and — > e +m ?/>, with a 6 R. This gives rise to 
the conserved current jy = ifjj^ip. So we could look at the theory of electromagnetism 
coupled to fermions, with the Lagrangian, 

C = -±F^F» U + ip(i0- m)%l) - e^A^ (6.66) 
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where we've introduced a coupling constant e. For the free Maxwell theory, we have 
seen that the existence of a gauge symmetry was crucial in order to cut down the 
physical degrees of freedom to the requisite 2. Does our interacting theory above still 
have a gauge symmetry? The answer is yes. To see this, let's rewrite the Lagrangian 

as 

C = -\F^ U F» V + i,{ilj)-m)i> (6.67) 

where D^ip = d^ip + ieA^tp is called the covariant derivative. This Lagrangian is 
invariant under gauge transformations which act as 

— >■ + d^X and ijj ->• e" ie > (6.68) 

for an arbitrary function X(x). The tricky term is the derivative acting on ip, since this 
will also hit the e~ ieX piece after the transformation. To see that all is well, let's look 
at how the covariant derivative transforms. We have 

= + ieA^ 

-+ «9 M (e" ie V) + *e(A^ + d M A) (e~ ie V) 

= e~ ieX D^ (6.69) 

so the covariant derivative has the nice property that it merely picks up a phase under 
the gauge transformation, with the derivative of e~ ieX cancelling the transformation 
of the gauge field. This ensures that the whole Lagrangian is invariant, since ip — > 

Electric Charge 

The coupling e has the interpretation of the electric charge of the particle. This 
follows from the equations of motion of classical electromagnetism d^F^ = j u : we 
know that the j° component is the charge density. We therefore have the total charge 
Q given by 

Q = eJ**WK*) (6.™) 
After treating this as a quantum equation, we have 

which is the number of particles, minus the number of antiparticles. Note that the 
particle and the anti-particle are required by the formalism to have opposite electric 
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charge. For QED, the theory of light interacting with electrons, the electric charge 
is usually written in terms of the dimensionless ratio a, known as the fine structure 
constant 



AttHc 137 
Setting H = c = 1, we have e = \f^Ka ~ 0.3. 

There's a small subtlety here that's worth elaborating on. I stressed that there's a 
radical difference between the interpretation of a global symmetry and a gauge symme- 
try. The former takes you from one physical state to another with the same properties 
and results in a conserved current through Noether's theorem. The latter is a redun- 
dancy in our description of the system. Yet in electromagnetism, the gauge symmetry 
seems to lead to a conservation law, namely the conservation of electric 
charge. This is because among the infinite number of gauge symmetries parameterized 
by a function A(x), there is also a single global symmetry: that with X(x) = constant. 
This is a true symmetry of the system, meaning that it takes us to another physical 
state. More generally, the subset of global symmetries from among the gauge symme- 
tries are those for which A(x) — > a = constant as i — > oo. These take us from one 
physical state to another. 

Finally, let's check that the 4x4 matrix C that we introduced in Section 4.5 really 
deserves the name "charge conjugation matrix". If we take the complex conjugation of 
the Dirac equation, we have 

(iy*^ _ efAp - m)il> = (-i(-f)*d^ - e{YT^ - m)^ = 

Now using the defining equation C^j^C = —(7^)*, and the definition ip^ = Ctp*, we 
see that the charge conjugate spinor satisfies 

(i^d^ + e 7 M M - m)^ (c) = (6.73) 

So we see that the charge conjugate spinor satisfies the Dirac equation, but with 
charge — e instead of +e. 

6.3.2 Coupling to Scalars 

For a real scalar field, we have no suitable conserved current. This means that we can't 
couple a real scalar field to a gauge field. 



(6.72) 
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Let's now consider a complex scalar field <p. (For this section, I'll depart from our 
previous notation and call the scalar field ip to avoid confusing it with the spinor). We 
have a symmetry cp — > e~ ia (p. We could try to couple the associated current to the 
gauge field, 

Ant = -i({dp<f)tp - v*d^)A» (6.74) 
But this doesn't work because 

• The theory is no longer gauge invariant 

• The current that we coupled to depends on d^tp. This means that if we 
try to compute the current associated to the symmetry, it will now pick up a 
contribution from the j^A^ term. So the whole procedure wasn't consistent. 

We solve both of these problems simultaneously by remembering the covariant deriva- 
tive. In this scalar theory, the combination 

V^ip = d^tp + ieA^ip (6.75) 

again transforms as V^ip — > e~ ieX V^(p under a gauge transformation A^ — )■ A^ + d^X 
and ip — > e~ teX ip. This means that we can construct a gauge invariant action for a 
charged scalar field coupled to a photon simply by promoting all derivatives to covariant 
derivatives 

C = - l -F^ + V^V»ip-m 2 \ip\ 2 (6.76) 

In general, this trick works for any theory. If we have a U(l) symmetry that we wish to 
couple to a gauge field, we may do so by replacing all derivatives by suitable covariant 
derivatives. This procedure is known as minimal coupling. 

6.4 QED 

Let's now work out the Feynman rules for the full theory of quantum electrodynamics 
(QED) - the theory of electrons interacting with light. The Lagrangian is 

C = -^F^F» U + 4>{iIp-m)iP (6.77) 

where = + ieA^. 

The route we take now depends on the gauge choice. If we worked in Lorentz gauge 
previously, then we can jump straight to Section 6.5 where the Feynman rules for QED 
are written down. If, however, we worked in Coulomb gauge, then we still have a bit of 
work in front of us in order to massage the photon propagator into something Lorentz 
invariant. We will now do that. 
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In Coulomb gauge V • A = 0, the equation of motion arising from varying A is now 

-V 2 A) = e^V = ej° (6.78) 

which has the solution 

M 2,t) = e J M (6.79) 
In Coulomb gauge we can rewrite the Maxwell part of the Lagrangian as 



^Maxwell 



d 3 x \E 2 - \B 2 



d 3 x \{A + VA Q ) 2 -\B 2 
d 3 x \A 2 + \{V A f - \B 2 



(6.80) 



where the cross-term has vanished using V ■ A = 0. After integrating the second term 
by parts and inserting the equation for A , we have 



■> Maxwell 



d 3 x 



\A 2 -\B 2 + - 



>3 / jo(x)j (x') 



d x 



Aty\x — x' 



(6.81) 



We find ourselves with a nonlocal term in the action. This is exactly the type of 
interaction that we boasted in Section 1.1.4 never arises in Nature! It appears here as 
an artifact of working in Coulomb gauge: it does not mean that the theory of QED is 
nonlocal. For example, it wouldn't appear if we worked in Lorentz gauge. 

We now compute the Hamiltonian. Changing notation slightly from previous chap- 
ters, we have the conjugate momenta, 



n 

71> 



dC 
dA 

dc 

dip 



A 



(6.82) 



which gives us the Hamiltonian 

' d 3 x 



\A 2 + \B 2 + iPi-irfdi + m)ip -ej-A + 



e 
~2 



, f f(S)f(x') 



d A x 



4ir\x — x' 



where j = ip^ip and j° = ip~f°ijj. 
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6.4.1 Naive Feynman Rules 

We want to determine the Feynman rules for this theory. For fermions, the rules are 
the same as those given in Section 5. The new pieces are: 

• We denote the photon by a wavy line. Each end of the line comes with an i,j = 
1, 2, 3 index telling us the component of A. We calculated the transverse photon 

% ( Pip ' 

propagator in (6.33): it is v/vn/wv and contributes Df- = — I — 

J p 2 + ie \ \p\ 2 

• The vertex ^^vwwv contributes —ie^j 1 . The index on 7* contracts with the 
index on the photon line. 

• The non-local interaction which, in position space, is given by 

2(e 7 °) 2 5(x - y°) 





contributes a factor of 



4ir\x — y\ 



These Feynman rules are rather messy. This is the price we've paid for working in 
Coulomb gauge. We'll now show that we can massage these expressions into something 
much more simple and Lorentz invariant. Let's start with the offending instantaneous 
interaction. Since it comes from the Aq component of the gauge field, we could try to 
redefine the propagator to include a D 00 piece which will capture this term. In fact, it 
fits quite nicely in this form: if we look in momentum space, we have 

S(x° - y°) f d A p e^-w) 



47r\x-y\ J (2tt) 4 \p\ 2 

so we can combine the non-local interaction with the transverse photon propagator by 
defining a new photon propagator 



DM 



p 2 + ie 



i 

+ p2 /i ' U = 

otherwise 



With this propagator, the wavy photon line now carries a /x, v = 0, 1, 2, 3 index, with 
the extra \x = component taking care of the instantaneous interaction. We now need 
to change our vertex slightly: the —ieY above gets replaced by —ie^ which correctly 
accounts for the (e7°) 2 piece in the instantaneous interaction. 
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The D 00 piece of the propagator doesn't look a whole lot different from the transverse 
photon propagator. But wouldn't it be nice if they were both part of something more 
symmetric! In fact, they are. We have the following: 



Claim: We can replace the propagator D^ u {p) with the simpler, Lorentz invariant 
propagator 

D„ u (p) = -i^ (6.85) 

Proof: There is a general proof using current conservation. Here we'll be more pedes- 
trian and show that we can do this for certain Feynman diagrams. In particular, we 
focus on a particular tree-level diagram that contributes to e~e~ — > e~e~ scattering, 

r 

~ e 2 [u(p')Yn(p)} D^(k) W)Yu{q)\ (6.86) 




where k = p — p' = q' — q. Recall that u{p) satisfies the equation 

(tf-m)u(p) = (6.87) 

Let's define the spinor contractions = u(p')'y tJ, u(p) and (3 V = u(q')^ v u(q). Then 
since k = p — p' = q' — q, we have 

k/iCe^ = u(p')(tf — p'^uip) = u(p')(m — m)u{p) = (6.88) 

and, similarly, k u (3 u = 0. Using this fact, the diagram can be written as 

a ■ (3 _ k 2 a /3 « 0j g 
k 2 k 2 \k\ 2 \k\ 2 

a- 8 1 



k 2 k 2 \k\ 2 



(k — k) a f3 



-±«./3 = aW_!^)^ (6 . 89) 
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which is the claimed result. You can similarly check that the same substitution is legal 
in the diagram 




- e 2 [v(q)YumD^(k)[u(f)Yv(q')} (6.90) 

In fact, although we won't show it here, it's a general fact that in every Feynman dia- 
gram we may use the very nice, Lorentz invariant propagator D^ v = —irj^/p 2 . □ 

Note: This is the propagator we found when quantizing in Lorentz gauge (using the 
Feynman gauge parameter). In general, quantizing the Lagrangian (6.37) in Lorentz 
gauge, we have the propagator 

2V = ~(*W + (a-l)^) (6-91) 

Using similar arguments to those given above, you can show that the p^Pu/p 2 term 
cancels in all diagrams. For example, in the following diagrams the p^Pv piece of the 
propagator contributes as 

~ u(p')Yu(p) k^ = u{p')(J- p")u(p) = 

v{p)Yu{q)k^ = u{p){p'+ i')u(q) = (6.92) 
6.5 Feynman Rules 

Finally, we have the Feynman rules for QED. For vertices and internal lines, we write 





Photon Propagator: v/X/WX/V 



Fermion Propagator: ^_ 



p l + xe 
i( 4 + ml 



p2 _ m 2 _|_ ^ e 
For external lines in the diagram, we attach 

• Photons: We add a polarization vector e^ n /e^ ut for incoming/outgoing photons. 
In Coulomb gauge, e° = and e • p = 0. 

• Fermions: We add a spinor u r (p)/u r (p) for incoming/outgoing fermions. We add 
a spinor v r (p)/v r (p) for incoming/outgoing anti-fermions. 
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6.5.1 Charged Scalars 

"Pauli asked me to calculate the cross section for pair creation of scalar 
particles by photons. It was only a short time after Bethe and Heitler 
had solved the same problem for electrons and positrons. I met Bethe 
in Copenhagen at a conference and asked him to tell me how he did the 
calculations. I also inquired how long it would take to perform this task; 
he answered, "It would take me three days, but you will need about three 
weeks." He was right, as usual; furthermore, the published cross sections 
were wrong by a factor of four." 



The interaction terms in the Lagrangian for charged scalars come from the covariant 
derivative terms, 



C = V»i> = d^d^i) - ieA^d^i) - ipd^) + e 2 A^A^^ (6.93) 



This gives rise to two interaction vertices. But the cubic vertex is something we haven't 
seen before: it contains kinetic terms. How do these appear in the Feynman rules? 
After a Fourier transform, the derivative term means that the interaction is stronger 
for fermions with higher momentum, so we include a momentum factor in the Feynman 
rule. There is also a second, "seagull" graph. The two Feynman rules are 



The factor of two in the seagull diagram arises because of the two identical particles 
appearing in the vertex. (It's the same reason that the 1/4! didn't appear in the 
Feynman rules for <j) A theory). 

6.6 Scattering in QED 

Let's now calculate some amplitudes for various processes in quantum electrodynamics, 
with a photon coupled to a single fermion. We will consider the analogous set of 
processes that we saw in Section 3 and Section 5. We have 



Viki Weisskopf 




and 
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Electron Scattering 

Electron scattering e~e~ — > e~e~ is described by the two leading order Feynman dia- 
grams, given by 




—i[—ie) 



{u s Xp')Yu s {p)} [u r '(q'h^u r (q)} 
(j/ — p) 2 

u s '(p')^u r (q)] [■u r '(g')7 At -u £ 
(p - Q.') 2 

The overall —% comes from the —ir]^ in the propagator, which contract the indices on 
the 7-matrices (remember that it's really positive for /i, v — 1, 2, 3). 

Electron Positron Annihilation 

Let's now look at e~e + — > 27, two gamma rays. The two lowest order Feynman 
diagrams are, 




+ 




= i(— ie) v\q) 



(p — p') 2 — m 2 



+ (p- q >) 2 -m 2 ) U ^ P )e ^ q) 
Electron Positron Scattering 

For e _ e + — > e~e + scattering (sometimes known as Bhabha scattering) the two lowest 
order Feynman diagrams are 




+ 




= -i(-ief 



+ 



[u s '{p')Yu s {p)\ [v r {q)iyv r ' (<f )] 
(p — p') 2 

[v r (q)^u s (p}} [u s ' \p')^^v r \q')] 
(p + q) 2 



Compton Scattering 

The scattering of photons (in particular x-rays) off electrons e~7 — > e~7 is known 
as Compton scattering. Historically, the change in wavelength of the photon in the 
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scattering process was one of the conclusive pieces of evidence that light could behave 
as a particle. The amplitude is given by, 




e ta (q) 



+ 



u(p) 




= i(—ie) 2 u r {p') 



(p + g) 2 — m 2 



+ 



(P - <?') 



out 



This amplitude vanishes for longitudinal photons. For example, suppose e; n ~ q. Then, 
using momentum conservation p + q = p' + q', we may write the amplitude as 



iA = i(—ie) 2 u r (p'^ 



l out V 



m) 



m) 



"-out 



i(-ie) 2 u r '(p') l ont u s {p) 



(p + q) 2 — m 2 {p' — q) 2 — m 2 

2p ■ q 2p' ■ q 



u s (p) 



+ 



(p + q) 2 — m 2 {p' — q) 2 — m 



(6.94) 



where, in going to the second line, we've performed some 7-matrix manipulations, 
and invoked the fact that q is null, so q 1 4 = 0, together with the spinor equations 
(^—m)u(p) and u(p')(p" — m) = 0. We now recall the fact that q is a null vector, while 
p 2 = (p') 2 = m 2 since the external legs are on mass-shell. This means that the two 
denominators in the amplitude read (p+q) 2 —m 2 = 2p-q and (p'—q)—m 2 = —2p'-q. This 
ensures that the combined amplitude vanishes for longitudinal photons as promised. A 
similar result holds when / out ~ q' . 



Photon Scattering 

In QED, photons no longer pass through each other unimpeded. 
At one-loop, there is a diagram which leads to photon scattering. 
Although naively logarithmically divergent, the diagram is actually 
rendered finite by gauge invariance. 

Adding Muons 



A 



Figure 30: 



Adding a second fermion into the mix, which we could identify as a 
muon, new processes become possible. For example, we can now have processes such 
as e~fi~ — >■ e~yU~ scattering, and e + e~ annihilation into a muon anti-muon pair. Using 
our standard notation of p and q for incoming momenta, and p' and q' for outgoing 
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momenta, we have the amplitudes given by 




6.6.1 The Coulomb Potential 



and 




(p + qY 



(6.95) 



We've come a long way. We've understood how to compute quantum amplitudes in 
a large array of field theories. To end this course, we use our newfound knowledge to 
rederive a result you learnt in kindergarten: Coulomb's law. 

To do this, we repeat our calculation that led us to the Yukawa force in Sections 
3.5.2 and 5.7.2. We start by looking at e~e~ — > e~e~ scattering. We have 




-ie\ 



(6.96) 



/A 

Following (5.49), the non-relativistic limit of the spinor is u(p) — > y/m . This 

w 

means that the 7 piece of the interaction gives a term u s (p) , ~y u r (q) — > 2m6 rs , while 
the spatial 7*, i = 1,2,3 pieces vanish in the non-relativistic limit: u s (p)^f l u r (q) — > 0. 
Comparing the scattering amplitude in this limit to that of non-relativistic quantum 
mechanics, we have the effective potential between two electrons given by, 



U(f) 



d 3 p eW 



+ 



(6.97) 



(2tt) 3 \p\ 2 ' 4irr 

We find the familiar repulsive Coulomb potential. We can trace the minus sign that 
gives a repulsive potential to the fact that only the Aq component of the intermediate 
propagator ~ —irj^ contributes in the non-relativistic limit. 

For e~e + —> e~e + scattering, the amplitude is 




+i(—ie) 



(6.98) 
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The overall + sign comes from treating the fermions correctly: we saw the same minus 
sign when studying scattering in Yukawa theory. The difference now comes from looking 
at the non-relativistic limit. We have vj°v — > 2m, giving us the potential between 
opposite charges, 



Reassuringly, we find an attractive force between an electron and positron. The differ- 
ence from the calculation of the Yukawa force comes again from the zeroth component 
of the gauge field, this time in the guise of the 7 sandwiched between v^v — > 2m, 
rather than the vv — > —2m that we saw in the Yukawa case. 

The Coulomb Potential for Scalars 

There are many minus signs in the above calculation which somewhat obscure the 
crucial one which gives rise to the repulsive force. A careful study reveals the offending 
sign to be that which sits in front of the A Q piece of the photon propagator —ii]^ u /p 2 . 

Note that with our signature (H ), the propagating Ai have the correct sign, while 

A comes with the wrong sign. This is simpler to see in the case of scalar QED, where 
we don't have to worry about the gamma matrices. From the Feynman rules of Section 
6.5.1, we have the non-relativistic limit of scalar e~e~ scattering, 



where the non-relativistic limit in the numerator involves (p+p') ■ (<? + </) ~ (p+p')°(q + 
q')o ~ (2m) 2 and is responsible for selecting the A part of the photon propagator rather 
than the Ai piece. This shows that the Coulomb potential for spin particles of the 
same charge is again repulsive, just as it is for fermions. For e~e + scattering, the 
amplitude picks up an extra minus sign because the arrows on the legs of the Feynman 
rules in Section 6.5.1 are correlated with the momentum arrows. Flipping the arrows 
on one pair of legs in the amplitude introduces the relevant minus sign to ensure that 
the non-relativistic potential between e~e + is attractive as expected. 




(6.99) 
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6.7 Afterword 



In this course, we have laid the foundational framework for quantum field theory. Most 
of the developments that we've seen were already in place by the middle of the 1930s, 
pioneered by people such as Jordan, Dirac, Heisenberg, Pauli and Weisskopf 5 . 

Yet by the end of the 1930s, physicists were ready to give up on quantum field theory. 
The difficulty lies in the next terms in perturbation theory. These are the terms that 
correspond to Feynamn diagrams with loops in them, which we have scrupulously 
avoided computing in this course. The reason we've avoided them is because they 
typically give infinity! And, after ten years of trying, and failing, to make sense of this, 
the general feeling was that one should do something else. This from Dirac in 1937, 

Because of its extreme complexity, most physicists will be glad to see the 
end of QED 

But the leading figures of the day gave up too soon. It took a new generation of postwar 
physicists — people like Schwinger, Feynman, Tomonaga and Dyson — to return to 
quantum field theory and tame the infinities. The story of how they did that will be 
told in next term's course. 



5 For more details on the history of quantum field theory, see the excellent book "QED and the 
Men who Made it" by Sam Schweber. 
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